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Description 

FIELD OF THE INVENTION 

5 [0001] This invention relates generally to the field of cytogenetics, and more particularly to the field of molecular 
cytogenetics. It concerns methods of determining the relative copy numbers of nucleic acid sequences in a subject 
cell or cell population and/or comparing the nucleic acid sequence copy numbers of substantially identical sequences 
in several cells or cell populations as a function of the location of those sequences in a reference genome. For instance, 
the methods of this invention provide the means to determine the relative number of copies of nucleic acid sequences 

10 in one or more subject genomes (for example, the DNA of one tumor cell or a number of cells from a subregion of a 
solid tumor) or portions thereof as a function of the location of those sequences in a reference genome (for example, 
a normal human -metaphase spread). Further, the invention provides methods of determining the absolute copy number 
of nucleic acid sequences in a subject cell or cell population. 

[0002] Although the examples herein concern human cells and the language is primarily directed to human concerns, 
15 the concept of this invention is applicable to genomes from any plant or animal. The genomes compared need only be 
related closely enough to have sufficient substantially identical sequences for a meaningful analysis. For example, a 
human genome and that of another primate could be compared according to the methods of this invention. 

BACKGROUND OF THE INVENTION 

20 

[0003] Chromosome abnormalities are associated with genetic disorders, degenerative diseases, and exposure to 
agents known to cause degenerative diseases, particularly cancer, German, "Studying Human Chromosomes Today, 
" American Scientist, 58: 1 82-201 (1 970); Yunis; "The Chromosomal Basis of Human Neoplasia," Science, 221 : 227-236 
(1983); and German, "Clinical Implication of Chromosome Breakage," in Genetic Damage in Man Caused by Environ- 

25 mental Agents, Berg, Ed., pgs. 65-86 (Academic Press : New York, 1979). Chromosomal abnormalities can be of several 
types, including: extra or missing individual chromosomes, extra or missing portions of a chromosome (segmental 
duplications or deletions), breaks, rings and chromosomal rearrangements, among others. Chromosomal or genetic 
rearrangements include translocations (transfer of a piece from one chromosome onto another chromosome), dicen- 
trics (chromosomes with two centromeres), inversions (reversal in polarity of a chromosomal segment), insertions, 

30 amplifications, and deletions. 

[0004] Detectable chromosomal abnormalities occur with a frequency ol one in every 250 human births. Abnormal- 
ities that involve deletions or additions of chromosomal material alter the gene balance of an organism and generally 
lead to fetal death or to serious mental and physical defects. Down syndrome can be caused by having three copies 
of chromosome 21 instead of the normal 2. This syndrome is an example of a condition caused by abnormal chromo- 

35 some number, or aneuploidy. Down syndrome can also be caused by a segmental duplication of a subregion on chro- 
mosome 21 (such as, 21q22), which can be present on chromosome 21 or on another chromosome. Edward syndrome 
(18+), Patau syndrome (13+), Turner syndrome (XO) and Kleinfelter syndrome (XXY) are among the most common 
numerical aberrations. [Epstein, The Consequences of Chromosome Imbalance: Principles, Mechanisms and Models 
(Cambridge Univ. Press 1986); Jacobs, Am. J. Epidemiol, 105: 180 (1977); and Lubs et al., Science, 169: 495 (1970).] 

40 [0005] Retinoblastoma (del 13q14), Prader-Willis syndrome (del 15q11-q13), Wilm's tumor (del 11p13) and Cri-du- 
chat syndrome (del 5p) are examples of important disease linked structural aberrations. [Nora and Fraser, Medical 
Genetics: Principles and Practice, (Lea and Febiger (1 989).] 

[0006] One of the critical endeavors in human medical research is the discovery of genetic abnormalities that are 
central to adverse health consequences. In many cases, clues to the location of specific genes and/or critical diagnostic 

45 markers come from identification of portions of the genome that are present at abnormal copy numbers. For example, 
in prenatal diagnosis, as indicated above, extra or missing copies of whole chromosomes are the most frequently 
occurring genetic lesion. In cancer, deletion or multiplication of copies of whole chromosomes or chromosomal seg- 
ments, and higher level amplifications of specific regions of the genome, are common occurrences. 
[0007] Much of such cytogenetic information has come over the last several decades from studies of chromosomes 

50 with light microscopy. For the past thirty years cytogeneticists have studied chromosomes in malignant cells to deter- 
mine sites of recurrent abnormality to glean hints to the location of critical genes. Even though cytogenetic resolution 
is limited to several megabases by the complex packing of DNA into the chromosomes, this effort has yielded crucial 
information. Among the strengths of such traditional cytogenetics is the ability to give an overview of an entire genome 
at one time, permitting recognition of structural abnormalities such as inversions and translocations, as well as dele- 

55 tions, multiplications, and amplifications of whole chromosomes or portions thereof. With the coming of cloning and 
detailed molecular analysis, recurrent translocation sites have been recognized as involved in the formation of chimeric 
genes such as the BCR-ABL fusion in chronic myelogeneous leukemia (CML); deletions have been recognized as 
frequently indicating the location ot tumor suppressor genes; and amplifications have been recognized as indicating 
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overexpressed genes. 

[0008] Conventional procedures for genetic screening and biological dosimetry involve the analysis of karyotypes. 
A karyotype is the particular chromosome complement of an individual or of a related group of individuals, as defined 
both by the number and morphology of the chromosomes usually in mitotic metaphase. It include such things as total 
chromosome number, copy number of individual chromosome types (e.g., the number of copies of chromosome X), 
and chromosomal morphology, e.g. , as measured by length, centromeric index, connectedness, or the like. Karyotypes 
are conventionally determined by chemically staining an organism's metaphase, prophase or otherwise condensed 
(for example, by premature chromosome condensation) chromosomes. Condensed chromosomes are used because, 
until recently, it has not been possible to visualize interphase chromosomes due to their dispersed condition and the 
lack of visible boundaries between them in the cell nucleus. 

[0009] A number of cytological techniques based upon chemical stains have been developed which produce longi- 
tudinal patterns on condensed chromosomes, generally referred to as bands. The banding pattern of each chromosome 
within an organism usually permits unambiguous identification of each chromosome type [Latt, "Optical Studies of 
Metaphase Chromosome Organization," Annual Review of Biophysics and Bioengineering, 5 : 1-37 (1976)]. 
[0010] Unfortunately, such conventional banding analysis requires cell culturing and preparation of high quality met- 
aphase spreads, which is time consuming and labor intensive, and frequently difficult or impossible. For example, cells 
from many tumor types are difficult to culture, and it is not clear that the cultured cells are representative of the original 
tumor cell population. Fetal cells capable of being cultured, need to be cultured for several, weeks to obtain enough 
metaphase cells for analysis. 

[001 1 ] Over the past decade , methods ol in situ hybridization have been developed that permit analysis of intact cell 
nuclei-interphase cytogenetics. Probes for chromosome centromeres, whole chromosomes, and chromosomal seg- 
ments down to the size of genes, have been developed. With the use of such probes, the presence or absence of 
specific abnormalities can be very efficiently determined; however, it is tedious to test for numerous possible abnor- 
malities or to survey to discover new regions of the genome that are altered in a disease. 

[0012] The present invention, Comparative Genomic Hybridization (CGH) [formerly called Copy Ratio Reverse Cy- 
togenetics (CRRC) among other names] provides powerful methods to overcome many of the limitations of existing 
cytogenetic techniques. When CGH is applied, for example, in the fields of tumor cytogenetics and prenatal diagnosis, 
it provides methods to determine whether there are abnormal copy numbers of nucleic acid sequences anywhere in 
the genome of a subject tumor cell or fetal cell or the genomes from representative cells from a tumor cell population 
or from a number of fetal cells, without having to prepare condensed chromosome spreads from those cells. Thus, 
cytogenetic abnormalities involving abnormal copy numbers of nucleic acid sequences, specifically amplifications and/ 
or deletions, can be found by the methods of this invention in the format of an immediate overview of an entire genome 
or portions thereof. More specifically, CGH provides methods to compare and map the frequency of nucleic acid se- 
quences from one or more subject genomes or portions thereof in relation to a reference genome. It permits the de- 
termination of the relative number of copies of nucleic acid sequences in one or more subject genomes (for example, 
those of tumor cells) as a function of the location of those sequences in a reference genome (for example, that of a 
normal human cell). 

[0013] Gene amplification is one of several mechanisms whereby cells can change phenotypic expression when 
increased amounts of specific proteins are required, for example, during development [Spradling and Mahowald, PNAS 
(USA), 77 : 1 096- 1 1 00 (1 980); Glover et at., PNAS (USA). 79: 2947-295 1 (1 982)], or during an environmental challenge 
when increased amounts of specific proteins can impart resistance to cytotoxic agents [Melera et al., J. Biol. Chem, 
255: 7024-7028 (1980); Beach and Palmiter, PNAS (USA, 78: 2110-2114 (1981)]. 

[001 4] A major limitation of Southern analysis and related conventional techniques for analysis of gene amplification 
is that only specific sites are studied leaving the vast majority of the genome unexamined. Conventional cytogenetic 
studies, on the other hand, provide a broad survey of the genome but provide little information about genes that may 
be involved in amplification events. However, the procedures of this invention overcome those limitations. This invention 
can be used to show the normal chromosomal locations of all regions of a genome that are amplified or deleted wherein 
the size of the regions that can be detected is limited only by the resolution of the microscopy used and the organization 
of DNA in condensed chromosomes. Thus, this invention provides among other uses the ability to study gene ampli- 
fications and deletions and their rotes in tumor development, progression and response to therapy more thoroughly 
than was possible previously. The methods of CGH are sufficiently rapid and simple that large numbers of subject 
nucleic acids, for example from many tumors, can be analysed in studies for gene amplification and deletion. 
[0015] The karyotypic heterogeneity in solid tumors can be extreme. Identification of commonly occurring chromo- 
somal changes by analysis of metaphase spreads is often difficult or impossible using conventional banding analysis 
because of the complexity of the rearrangements and because of the poor quality of the metaphase preparations. CGH 
overcomes that limitation in that the tumor nucleic acid can be studied without the requirement of preparing metaphase 
spreads. Since CGH can probably be performed on single cells by amplifying the nucleic acid therefrom, CGH can be 
used to investigate the heterogeneity of tumors by studying representative cells from different cell populations of the 
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tumor. Alternatively, CGH of nucleic acid from a tumor extracted in a bulk extraction process from many cells of the 
tumor can reveal consistencies within the apparent heterogeneity. For example, the same amplified sequences may 
appear as homogeneously staining regions (HSRs) and/or double minute chromosomes (DMs) in one tumor cell but 
as an extension of achromosome arm in anothertumorcell. Thus, order from the apparent randomness may be realized 
by the use of CGH. 

[0016] Montgomery et al., PNAS (USA), 80 : 5724-5728 (September 1983), concerns the hybridization of labeled Cot 
fractionated DNAs from tumor cell lines (a Cot fraction from which the high copy repeats, low copy repeats and single 
copy sequences were substantially removed) to metaphase spreads from said tumor cell tines. Basically, Montgomery 
et al. mapped the positions of nucleic acid sequences from tumor cell lines that are very highly amplified back to tumor 
cell tine genomes. 

[0017] Total genomic DNA from one species has been used in in situ hybridization to discriminate in hybrid cells 
between chromosomes of that species and of a different species on the basis of the signal from the high copy repetitive 
sequences. [Pinkel et al., PNAS (USA), 83 : 2934 (1 986); Manuelidis, Hum. Genet., 71 : 288 (1 985); and Durnam et al., 
Somatic Cell Molec. Genet., 11: 571 (1985).] 

[001 8] Landegent et al. , Hum. Genet., 77 : 366-370 (1 987), eliminated highly repetitive sequences, like Alu and Kpn 
fragments/from whole cosmid cloned genomic sequences by blocking the highly repetitive sequences with Cot-1 DNA. 
The resulting probe was used for in situ hybridization. 

[0019] European Patent Application Publication No. 430,402 (published June 5, 1 991 ) describes methods and com- 
positions for chromosome-specific painting, that is, methods and compositions for staining chromosomes based upon 
nucleic acid sequence employing high complexity nucleic acid probes. In general in the chromosome-specific painting 
methods, repetitive sequences not specific to the targeted nucleic acid sequences are removed from the hybridization 
mixture and/or their hybridization capacity disabled, often by blocking with unlabeled genomic DNA or with DNA en- 
riched for high copy repetitive sequences as is Cot-1 [commercially available from Bethesda Research Laboratory, 
Gaithersburg, MD (USA)]. Pinkel et al., PNAS (USA), 85: 9138-9142 (1988) also describes aspects of chromosome- 
specific painting as well as International Publication No. WO 90/05789 (published May 31 1 1990 entitled " In Situ Sup- 
pression Hybridization and Uses Therefor"). 

[0020] Chromosome-specific repeat sequence probes and chromosome-specific painting probes can be hybridized 
in situ to interphase nuclei as well as metaphase spreads and provide information about the genetic state of the indi- 
vidual targeted genomes. A limitation of such hybridizations is that cytogenetic information is only provided from the 
regions to which the probes bind. Such hybridizations are very useful for determining if a particular abnormality is 
present, for example, the deletion of a specific gene or a duplication among other abnormalities, but it is laborious to 
search for currently unknown abnormalities on a region by region basis. 

[0021] Other methods of searching for unknown genetic abnormalities similarly require a lot of work. For example, 
looking for loss of heterozygosity in tumor cells, requires the hybridization Of many probes to Southern blots of tumor 
and normal cell DNA. CGH provides methods to overcome many of the limitations of the existing cytogenetic techniques. 
[0022] Saint-Ruf et al., Genes, Chromosomes & Cancer, 2: 18-26 (1990) concluded from their studies of breast 
cancer that although amplification of genetic material is a frequent and probably important event in breast carcinogen- 
esis, that the relevant genes involved in such amplifications remain unknown but do not seem to correspond to the 
proto-oncogenes commonly considered important in breast cancer. 

[0023] Since HSRs in tumors are most often not at the site where the unamplified gene is in normal cells, standard 
cytogenetics does not yield any information that could assist with identification of the gene(s). CGH on the other hand 
permits mapping them in the normal genome, a major step towards their identification. 

[0024] Dutrillaux et al. : Cancer Genet. Cytoqenet, 49 : 203-217 (1990) report (at page 203) that "[although human 
breast carcinomas are among the most frequent malignant tumors, cytogenetic data remain scarce, probably because 
of their great variability and of the frequent difficulty of their analysis." In their study of "30 cases with relatively simple 
karyotypes to determine which anomalies occur the most frequently and, in particular, early during tumor progression" 
(p. 203), they concluded that "trisomy 1 q and monosomy 1 6q are early chromosomal changes in breast cancer, whereas 
other deletions and gain of 8q are clearly secondary events." [ Abstract , p. 203 ] Dutrillaux et at. further state (at page 
216) that deletions within tumor suppressor genes "characterize tumor progression of breast cancer." 
[0025] It is believed that many solid tumors, such as breast cancer, progress from initiation to metastasis through 
the accumulation of several genetic aberrations. [Smith et al., Breast Cancer Res. Treat., 18 Suppl. 1 : S 5-14 (1991); 
van de Vijver and Nusse, Biochim. Biophys. Acta, 1072: 33-50 (1 991 ); Sato et al. , Cancer Res., 50: 71 84-71 89 (1 990).] 
Such genetic aberrations, as they accumulate, may confer proliferative advantages, genetic instability and the attendant 
ability to evolve drug resistance rapidly, and enhanced angiogenesis : proteolysis and metastasis. The genetic aberra- 
tions may affect either recessive "tumor suppressor genes" or dominantty acting oncogenes. Deletions and recombi- 
nation leading to loss of heterozygosity (LOH) are believed to play a major role in tumor progression by uncovering 
mutated tumor suppressor alleles. 

[0026] Dominantly acting genes associated with human solid tumors typically exert their effect by overexpression or 
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altered expression. Gene amplification is a common mechanism leading to upregulation of gene expression. [Stark et 
aL, Cell, 75 : 901-908 (1989).] Evidence from cytogenetic studies indicates that significant amplification occurs in over 
50% of human breast cancers. [Saint-Ruf et aL, supra .) A variety of oncogenes have been found to be amplified in 
human malignancies. Examples of the amplification of cellular oncogenes in human tumors is shown in Table 1 below. 

5 

TABLE 1 



Amplified Gene 


Tumor 


Degree of Amplification 


DM or HSR Present 


c-myc 


Promyelocytic leukemia cell line, HL60 


20x 


+ 




Small-cell lung carcinoma cell lines 


5-30x 


7 


N-myc 


Primary neuroblastomas (stages III and IV) 


5-1000X 


+ 




and neuroblastoma cell lines 








Retinoblastoma cell line and primary tumors 


10-200X 


+ 




Small-cell lung carcinoma cell lines and . 


50x 


+ 




tumors 






L-myc 


Small-cell lung carcinoma cell lines and 


10-20X 


? 




tumors 






c-myb 


Acute myeloid leukemia 


5-1 Ox 


? 




Colon carcinoma cell lines 


10x 


? 


c-erbB 


Epidermoid carcinoma cell 


30x 


? 




Primary gliomas 




? 


c-K-ras-2 


Primary carcinomas of lung, colon, bladder, 


4-20x 


7 




and rectum 






N-ras 


Mammary carcinoma cell line 


5-1 Ox 


7 



SOURCE: modified from Varmus, Ann. Rev. Genetics, 18 : 553-612 (1984) [cited in Watson et al., Molecular Biology 
of the Gene (4th ed.; Benjamin/Cummings Publishing Co. 1987)] 

[0027] Chromosomal deletions involving tumor suppressor genes may play an important role in the development 

30 and progression of solid tumors. The retinoblastoma tumor suppressor gene (Rb-1), located in chromosome 13q14, 
is the most extensively characterized tumor suppressor gene [Friend et ai., Nature, 323 : 643 (1 986); Lee et al. , Science, 
235 : 1 394 (1 987); Fung et al., Science, 236 : 1 657 (1 987)]. The Rb-1 gene product, a 1 05 kDa nuclear phosphoprotein, 
apparently plays an important role in cell cycle regulation [Lee et al., supra (1987); Howe et al. ; PNAS (USA), 87: 5883 
(1 990)]. Altered or lost expression of the Rb protein is caused by inactivation of both gene alleles either through a point 

35 mutation or a chromosomal deletion. Rb-1 gene alterations have been found to be present not only in retinoblastomas 
[Friend et al., supra (1986); Lee et al., supra (1987); Fung et al., supra (1987)] but also in other malignancies such as 
osteosarcomas [Friend et al., supra (1986)] : small cell lung cancer [Hensel et al., Cancer Res., 50: 3067 (1990); Ryg- 
aard et al., Cancer Res., 50: 5312 (1 990)] and breast cancer [Lee et al., Science, 241 : 218(1 988); TAng et al. , Science, 
242: 263 (1988); Varley et al., Oncogene, 4 : 725 (1989)]. Restriction fragment length polymorphism (RFLP) studies 

40 have indicated that such tumor types have frequently tost heterozygosity at 13q suggesting that one of the Rb-1 gene 
alleles has been lost due to a gross chromosomal deletion [Bowcock et aL, Am. J. Hum. Genet., 46 : 12 (1990)]. 
[0028] The deletion of the short arm of chromosome 3 has been associated with several cancers, for example, small 
cell lung cancer, renal and ovarian cancers; it has been postulated that one or more putative tumor suppressor genes 
is or are located in the p region of chromosome 3 (ch. 3p) [Minna et aL, Symposia on Quantitative Biology , Vol. L1 : 

45 843-853 (SCH Lab 1986); Cohen et aL, N. Eng. J. Med., 301 : 592-595 (1979); Bergerham et at., Cancer Res., 49 : 
1 390-1 396 (1 989); Whang-Peng et al., Can. Genet. CytogeneL, 11 : 91 -1 06 (1 984; and Trent et aL, Can. Genet. Cy- 
togenet., 14 : 153-161 (1985)]. 

[0029] The above-indicated collection of amplified and deleted genes is far from complete. As the Saint-Ruf et al. 
study ( supra ) of oncogene amplification in cells showing cytogenetic evidence of amplification, such as double minutes 
50 (DMs) or homogeneously staining regions (HSRs), indicated, the amplified genes were not known oncogenes in most 
cases. As Dutrillaux et al. , supra indicated, "cytogenetic data remains scarce" for "the most frequent malignant tumors"-- 
breast carcinomas. 

[0030] Discovery of genetic changes involved in the development of solid tumors has proven difficult. Karyotyping 
is impeded by the low yield of high quality metaphases and the complex nature of chromosomal changes [Teyssier, J. 
55 R., Cancer Genet. Cytogenet., 37: 1 03 (1 989)]. Although molecular genetic studies of isolated tumor DNA have been 
more successful and permitted detection of common regions of allelic loss, mutation or amplification [Fearon et aL, 
Cell. 61 : 759 (1990); Sato et al., Cancer Res., 50 : 7184 (1990); Atitalo et al., Adv. Cancer Res. ; 47 : 235 (1986); and 
Schwab and Amler, Genes Chrom. Cancer., 1 : 181 (1 990)], such molecular methods are highly focused, targeting one 
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specific gene or chromosome region at a time, and leaving the majority of the genome unexamined. 
[0031] Thus, a research tool leading to the identification of amplified and deleted genes and providing more cytoge- 
netic data regarding tumors, especially tumor progression and invasiveness is needed in tumor cytogenetics. CGH 
provides such a molecular cytogenetic research tool. 

[0032] The ability to survey the whole genome in a single hybridization is a distinct advantage over allelic loss studies 
by restriction fragment length polymorphism (RFLP) that target only one locus at a time. RFLP is also restricted by the 
availability and informativeness of polymorphic probes. 

[0033] CGH facilitates the genetic analysis of tumors in that it provides a copy number karyotype of the entire genome 
in a single step. Regions of tumor DNA gain and loss are mapped directly onto normal chromosomes. Comparisons 
of primary tumors with their metastases by CGH should be informative concerning cancer progression. Analogously, 
other genomes other than those of tumors can be studied by CGH. 

SUMMARY OF THE INVENTION 

[0034] Comparative Genomic Hybridization (CGH) employs the kinetics of in situ hybridization to compare the copy 
numbers of different DNA or RNA sequences from a sample. In accordance with the present invention, the copy num- 
bers of different DNA or RNA sequences in one ceil or cell population are compared to the copy numbers of the 
substantially identical sequences in another cell or cell population. The comparisons can be qualitative or quantitative. 
Procedures are described that permit determination of the absolute copy numbers of DNA sequences throughout the 
genome of a cell or cell population if the absolute copy number is known or determined for one or several sequences. 
The different sequences are discriminated from each other by the different locations of their binding sites when hybrid- 
ized to a reference genome, usually metaphase chromosomes but in certain cases interphase nuclei. The copy number 
information originates from comparisons of the intensities of the hybridization signals among the different locations on 
the reference genome. 

[0035] As illustrated herein, genomic DNAs from two or more subject cells or cell populations are isolated, differen- 
tially labeled, and hybridized to reference chromosomes, usually in metaphase. 

[0036] The CGH methods of this invention can be qualitative and/or quantitative. A particular utility of CGH is for 
analysing DNA sequences from clinical specimens including tumor and fetal tissues. 

[0037] An important utility of CGH is to find regions in normal genomes which when altered in sequence copy number 
contribute to disease, as for example, cancer or birth defects. For example, regions at elevated copy number may 
contain oncogenes, and regions present at decreased copy number may contain tumor suppressor genes. 
[0038] Exemplary methods are those wherein the subject nucleic acids are DNA sequences from a subject cell or 
cell population. Analogous methods may be performed wherein the subject nucleic acids are RNA. Such an exemplary 
method is that for comparing copy numbers of different DNA sequences in one subject eel! or cell population relative 
to copy numbers of substantially identical sequences in another cell or cell population, said method comprising the 
steps of: 

a) extracting the DNA from both of the subject cells or cell populations; 

b) amplifying said extracted subject DNAs : if necessary; 

c) differentially labeling the subject DNAs; 

d) hybridizing said differentially labeled subject DNAs in situ to reference metaphase chromosomes after substan- 
tially removing from the labeled DNAs those repetitive sequences that could bind to multiple loci in the reference 
metaphase chromosomes, and/or after blocking the binding sites for those repetitive sequences in the reference 
metaphase chromosomes by prehybridization with appropriate blocking nucleic acids, and/or blocking those re- 
petitive sequences in the labeled DNA by prehybridization with appropriate blocking nucleic acid sequences, and/ 
or including such blocking nucleic acid sequences for said repetitive sequences during said hybridization; 

e) rendering the bound, differentially labeled DNA sequences visualizabte, if necessary; 

f) observing and/or measuring the intensities of the signals from each subject DNA, and the relative intensities, as 
a function of position along the reference metaphase chromosomes; and 

g) comparing the relative intensities among different locations along the reference metaphase chromosomes 
wherein the greater the intensity of the signal at a location due to one subject DNA relative to the intensity of the 
signal due to the other subject DNA at that location, the greater the copy number of the sequence that binds at 
that location in the first subject cell or celt population relative to the copy number of the substantially identical 
sequence in the second subject cell or cell population that binds at that location. 

[0039] Further disclosed are methods of quantitatively comparing copy numbers of different DNA sequences in one 
subject cell or cell population relative to copy numbers of substantially identical sequences in another subject cell or 
cell population. A representative method is that comprising steps (a) through (e) of the method immediately detailed 
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above and the following steps of: 

f. measuring the intensities of the signals from each of the bound subject DNAs and calculating the ratio of the 
intensities as a function of position along the reference metaphase chromosomes to form a ratio profile; and 

g. quantitatively comparing the ratio profile among different locations along the reference metaphase chromo- 
somes : said ratio profile at each location being proportional to the ratio of the copy number of the DNA sequence 
that bind to that location in the first subject cell or cell population to the copy number of substantially identical 
sequences in the second cell or cell population. 

[0040] Said representative methods can further comprise comparing copy numbers of different DNA sequences in 
more than two subject DNAs wherein the comparing is done pairwise between the signals from each subject DNA. 
[0041 J This invention further discloses methods to determine the ratio of copy numbers of different DNA sequences 
in one subject cell or cell population to copy numbers of substantially identical sequences in another cell or cell pop- 
ulation wherein the steps of (a) through (f) as described above are performed as well as the following steps: 

g. determining the average copy number of a calibration sequence in both subject cells or cell populations, said 
calibration sequence being substantially identical to a single copy sequence in the reference metaphase cells; and 

h. normalizing the ratio profile calculated in (f) so that at the calibration position, the ratio profile is equal to the 
ratio of the average copy numbers determined in (g), the normalized ratio profile at any other location along the 
reference metaphase chromosomes thereby giving the ratio of the copy numbers of the DNA sequences in the 
two subject DNAs that bind at that location. That method can be extended to further subject nucleic acids as for 
example determining tha ratio of copy numbers of DNA sequences in more than two subject DNAs wherein the 
comparing is done pairwise between signals from each subject DNA. 

[0042] Further disclosed are methods for comparing copy numbers of different DNA sequences in a test cell or cell 
population, said method comprising applying steps (a) through (e) of the above-described methods and 

f. observing and/or measuring the intensities of the signal from each subject DNA, and the relative intensities, as 
a function of position along the reference metaphase chromosomes wherein one of the subject cells or cell pop- 
ulations is the test cell or ceil population and the other is a normal cell or cell population; and 
(g) comparing the relative intensities among different locations along the reference metaphase chromosomes, 
wherein the greater the relative intensity at a location, the greater the copy number of the sequence in the test cell 
or cell population that binds to that location, except for sex chromosomes where the comparison needs to take 
into account the differences in copy numbers of sequences in the sex chromosomes in relation to those on the 
autosomes in the normal subject cell or ceil population. 

[0043] A related representative method is that for comparing the copy number of different DNA sequences in a test 
cell or cell population comprising applying steps (a) through (e) of the above described methods wherein one of the 
subject cells or cell populations is the test cell or cell population, and the other is a standard cell or cell population 
wherein the copy numbers of the DNA sequences that bind to different positions on the reference metaphase chromo- 
somes is known and steps: 

f. measuring the intensities of the signals from each of the bound subject DNAs and calculating the ratio of inten- 
sities as a function of position along the reference metaphase chromosomes to form a ratio profile; 

g. adjusting the ratio profile at each location along the reference metaphase chromosomes by multiplying the ratio 
profile by the known copy number of DNA sequences in the standard cell or cell population that bind there: and 

h. comparing the adjusted ratio profiles at different locations along the reference metaphase chromosomes wherein 
the greater the adjusted ratio profile at a location, the greater the copy number of the DNA sequence in the test 
cell or cell population that binds there. 

[0044] Another related representative method is that for determining the ratios of the copy numbers of different DNA 
sequences in a test cell or cell population, said method comprising applying steps (a) through (f) of the immediately 
above-described method and the steps of adjusting the ratio profile at each location along the reference metaphase 
chromosomes by multiplying the ratio profile by the known copy number of sequences that bind there; and calculating 
the ratio of the copy number of a DNA sequence in the test cell or cell population that binds to one location on the 
reference metaphase chromosomes to the copy number of a sequence that binds to another location by dividing the 
adjusted ratio profile at the location of the first sequence by that at the location of the second. Said representative 
method can be extended to determine the copy number of different DNA sequences in a test cell or cell population 
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wherein steps (a) through (f) as described above are followed and then the following steps of adjusting the ratio profile 
at each location along the reference metaphase chromosomes by multiplying the ratio profile by the known copy number 
of DNA sequences in the standard cell or cell population that bind there; 

determining the copy number of a calibration sequence in the test cell or cell population that is substantially identical 
to a single copy sequence in the reference cells; and 

normalizing the adjusted ratio profile so that at the location of the calibration sequence on the reference metaphase 
chromosomes, the normalized, adjusted ratio profile is equal to the copy number of the calibration sequence de- 
termined in the above step, the value of the normalized, adjusted ratio profile at another location then being equal 
to the copy number of the DNA sequence in the test cell or cell population that binds at that location. That method- 
can be analogously performed wherein two or more calibration sequences are used, and the adjusted ratio profile 
is normalized to get the best fit to the copy numbers of the ensemble of calibration sequences. Preferably, the 
copy number of the calibration sequence is determined by in situ hybridization. Those methods can comprise in 
situ hybridizing probes for more than one calibration position and normalizing to obtain the best fit of the ratio 
profile to the calibration positions. The standard cell or cell population preferably have normal genomes. In many 
applications of CGH, the reference metaphase chromosomes are normal. 

[0045] Further, this invention concerns the use of antenna cell lines. An exemplary method is for detecting amplifi- 
cation of a certain sequence or group of sequences in a subject cell or cell population, comprising essentially steps 
(a) through (e) of the above-described methods wherein the in situ hybridization is targeted to antenna cells in which 
the DNA sequence(s) to be tested for is or are amplified, and examining the reference cell for regions that are hybridized 
significantly more intensely than others, the presence of such regions indicating amplifications of the sequence(s) 
which are being tested. The chromosomes of said antenna cell lines may be in interphase or in metaphase. 
[0046] The two or more labeled subject nucleic acids can be hybridized in situ to the reference genome sequentially 
or simultaneously. Simultaneous in situ hybridization is preferred in that saturation of the targeted binding sites in the 
reference genome will not interfere with the procedure. When sequential in situ hybridization is used : it must be per- 
formed under conditions wherein the individual hybridizations are stopped well before the binding sites on the reference 
chromosomes are saturated. 

[0047] Objects of this invention are to detect sequence copy number imbalances throughout an entire genome in 
one hybridization, to map gains and/or losses of sequences in a genome, and/or to provide a copy number karyotype 
of a subject genome. 

[0048] Further, an object of this invention is to enable the detection of relative copy number differences that are 
common to a number of different cells and/or celt populations. For example, CGH methods can be used wherein DNAs 
extracted from cells of many different tumors are combined and labeled; the hybridization of those combined labeled 
DNAs to normal condensed chromosomes, provides for the rapid identification of only those copy number changes 
that occurred in most of the tumors. Less frequently occurring variations would be averaged out. Thus, this invention 
further provides for a CGH method wherein two or more of the subject nucleic acids that were extracted from different 
cells and/or from numbers of cells from different cell populations, are labeled the same, and hybridized to a reference 
spread under conditions wherein repetitive sequences are removed and/or suppressed and wherein sequence copy 
number differences that are common in said combined labeled nucleic acid sequences are determined. 
[0049] Another object of this invention is to provide the means of cytogenetically analysing archived chromosomal 
material, that is, fixed material from, for example, biopsied tissue specimens, preferably cataloged and keyed to medical 
records of patients from whom the specimens were taken, and archaeological chromosomal material. Such chromo- 
somal material cannot, of course, be karyotyped according to traditional means in that no live cells are present to 
culture and from which to prepare chromosomal spreads. However, the nucleic acid can be extracted therefrom and 
amplified by a polymerase chain reaction (PCR) procedure or by a non-PCR procedure and tested by the methods of 
this invention. 

[0050] This invention further provides for a method to detect simultaneously an ensemble of amplifications and/or 
deletions in a tumor wherein the results can be used to determine the subsequent behaviorof that tumor. Said deter- 
mination is made by associating the patterns of amplifications and/or deletions in tumor cells with the behaviorof that 
tumor. Such associations can be made by testing, for example, as indicated immediately above, DNA from archived 
tumor tissue keyed to medical records, or when fresh tumor specimens are tested by CGH and the patients are followed. 
Further, such associations can be made with CGH methods wherein there are more than one subject cell and/or cell 
population, for example, one or more tumors. 

[0051] Another object of this invention is to provide a method of analyzing cells from a suspected lesion at an early 
stage of development. An advantage of the methods of this invention is that only a few cells are necessary for the 
analysis. The early detection of amplifications and/or deletions in cells from a lesion allow for early therapeutic inter- 
vention that can be tailored to the extent of, lor example, invasiveness known to be associated with such genetic 
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rearrangements- Further, such earfy detection provides a means to associate the progression of the cells with the 
genetic rearrangements therein detected by the methods of this invention. 

[0052] Tumors can be karyotypically heterogeneous containing therein various populations of cells each having dif- 
ferent types of genetic rearrangements. As indicated above tumor cells are difficult to culture, and it is not clear that 
cultured cells are representative of the original tumor cell population. This invention provides the means to by-pass 
the culturing obstacle and allows genetic characterization of tumor cells and thus, of the heterogeneity of tumors by 
testing cells from different subregions thereof according to the methods of this invention. Bulk extraction of the nucleic 
acid from many cells of a tumor can also be used to test for consistent amplifications and/or deletions within a tumor. 
[0053] It is another object of this invention to provide methods of detecting amplifications and/or deletions of nucleic 
acid sequences wherein certain cell lines termed herein "antenna cell lines", are used to enhance the sensitivity of the 
detection. 

[0054] It is still further an object of this invention to provide methods -of prenatal or perinatal analysis wherein the 
nucleic acid of the child's cells is extracted and tested according to the methods of this invention. In one embodiment 
of CGH, such material is human and hybridized to a normal human metaphase spread to detect whether any deletions 
and/or amplifications are therein present, for example, an extra copy of chromosome 21 , diagnostic for Down syndrome. 

BRIEF DESCRIPTION OF THE FIGURES 

[0055] figure 1 schematically illustrates the general approach used in performing the methods of this invention- 
Comparative Genomic Hybridization (CGH). The reference chromosome spread is hybridized with various nucleic acid 
mixtures, either simultaneously or at different times, to obtain the desired information. Representative mixtures could 
include unlabeled sequences-designed to block sequences in the various other nucleic acid pools, for example, the 
high-copy repetitive sequences in human genomic DNA; unlabeled competitor nucleic acid to prevent saturation of the 
target sites for the labeled mixtures, for example, human genomic DNA within a factor of 10 of the concentration used 
for the labeled subject nucleic acids (see Figure 4); and one or more pools of sequences of different origin that are 
differently labeled so tha| their binding can be independently assessed, for example, tumor and normal genomic DNA 
(see Figures 5 and 6). The information on the sequence frequency of the labeled pools is obtained by analysis of the 
intensity of the individual signals and/or the differences in ratios of intensities among the signals as a function of position 
along the reference chromosomes. 

[0056] Figure 2 outlines general aspects of the CGH procedure used in Example 1 , infra. The reference chromosome 
spread, in this example normal human chromosomes, is first hybridized for about one hour with a high concentration 
of unlabeled human genomic DNA (Figure 2A). That prehybridization blocks many of the high copy repetitive sequences 
in the chromosomes so that the high copy repetitive sequences in the labeled subject nucleic acid, in this case labeled 
tumor.DNA, will not substantially contribute to the signal during the subsequent hybridization. The labeled tumor DNA, 
and perhaps some competitor DNA or other comparison nucleic acid are then hybridized to the target reference spread 
(Figure 2B). Cot-1 DNA can be included in the hybridization as in Example 1, below to block more effectively the 
centromeric repetitive sequences in the labeled subject nucleic acids. 

[0057] Figure 2 is representative of one way of reducing signals from repetitive sequences. Other methods are de- 
tailed herein infra . In each of the CGH methods including the procedures outlined in the rest of the figures, some means 
of reducing the signal from the repetitive sequences is used, but not specifically indicated in the figures. It is important 
for CGH that the signal from each subject nucleic acid be dominated by sequences that bind to well-defined loci. Total 
suppression of the signal from the genomic repeats is not necessary, but the poorer the suppression, the less able the 
procedure is to detect small differences in sequence frequency. 

[0058] Figure 3 further illustrates the procedure used in Example 1 . As shown in Figure 3A, labeled human tumor 
DNA is hybridized to a normal human chromosome spread. [As indicated in the description for Figure 2, provisions 
were made to suppress the signal from the repetitive sequences although the provisions are not specifically indicated 
in the figure. Example 1 details a preferred method to suppress the hybridization signals from repetitive sequences.] 
In this representative example, the tumor DNA is assumed to contain a region wherein some sequences are highly 
amplified, for example, an amplicon containing an oncogene. The amplified sequences in the tumor DNA may be 
clustered and integrated in some tumor chromosomes ; they may be integrated into multiple places in the tumo r genome; 
or, they may exist as extra-chromosomal elements. The sequences of the amplicon will map to some chromosomal 
location in the reference genome, which in this case is a normal human genome. 

[0059] Figure 3B illustrates the kinetics of the build-up of the signal on a target reference chromosome. The signal 
builds more rapidly in the amplified region since more copies of those sequences are available for hybridization. If the 
reaction is stopped before the target chromosome is saturated; or if insufficient labeled DNA is added to achieve 
saturation, then the genomic region that was amplified in the tumor will appear higher in intensity on the normal chro- 
mosome as illustrated by the band with the denser shading on the left reference chromosome. The more intensely 
labeled region (band with the denser shading) indicates the location and extent of the amplicon as reflected in the 
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reference genome. Thus, the amplification is detected without prior knowledge of its existence, and the origin of the 
amplified sequences is mapped in the normal human genome. 

[0060] If the reaction illustrated in Figure 3 is allowed to proceed to saturation of the target sites, contrast is lost, as 
shown by the representative reference chromosome on the right wherein the amplicon cannot be distinguished Thus, 
5 in this embodiment of CGH, it is important to stop the hybridization before saturation of the target or provide insufficient 
probe for saturation. The graphs schematically show the build-up of the hybridization signal in the region that was 
amplified (graph on right) and in the remainder that was unamplified (graph on left). The arrows connect the chromo- 
somal regions with the times of observation on the kinetic curve. 

[0061] Figure 4 illustrates an embodiment of CGH that avoids the potential saturation of the target as shown in the 
10 right portion of Figure 3B. In this representative example, the reference nucleic acid is a human chromosome spread; 
the subject nucleic acid is labeled tumor DNA (4A). If unlabeled human genomic DNA is included with the labeled tumor 
DNA in excess, in this case at a five-fold higher concentration than that of the labeled tumor DNA, then any saturation- 
of the target will be due to a combination of labeled and unlabeled copies of the nucleic acid sequences, rather than 
just labeled copies as shown in the right portion of Figure 3B. [Once again, as indicated in Figures 2 and 3 the means 
*s of reducing the signal from repetitive sequences is not indicated in this figure, but it is assumed that a protocol is 
performed to remove substantially the repetitive sequences that would bind to multiple loci in the reference genome 
and/or to block such sequences from binding to the target J 

[0062] At the early stages of the reaction, the amplified region will build up faster than elsewhere in the chromosome 
(for example if the sequence is amplified five-fold, it would build up 5 times as fast) and will be detectable as in the (eft 

20 portion of Figure 3B. However as the reaction proceeds to saturation, the unamplified regions of the chromosome 
reach only one-fifth (1/5) of the intensity shown in the right portion of Figure 3B, because most of the sites are filled 
by unlabeled copies of the sequences. On the other hand, a sequence that was amplified five-fold in the tumor would 
reach one-half (1/2) of the saturation intensity since an equal number of labeled and unlabeled copies of those se- 
quences are present. Thus, contrast is maintained according to this embodiment at all stages of the reaction (as shown 

25 in Figure 4B), although it changes as the reaction proceeds. 

[0063] Figure 5 illustrates an embodiment of CGH designed to enhance its sensitivity in detecting small changes in 
copy number of various sequences. When a CGH procedure as indicated in Figure 4 is followed, intrinsic variation in 
the saturation levels, or rate of signal build-up at different positions in the reference genome may not be indicative of 
abnormal gain or loss of sequences. Such intrinsic variations would interfere with interpretation of intensity differences 

30 as indicating differences in copy number of the sequences. This CGH embodiment overcomes that potential problem 
by providing a mixture of labeled subject nucleic acid, in this case tumor DNA labeled with a green fluorochrome, and 
a differently labeled competitor nucleic acid in this case normal human genomic DNA labeled with a red fluorochrome. 
The two differently labeled DNAs are simultaneously hybridized to the chromosome spread. [Once again, removal of 
the repetitive sequences and/or blocking of the signal therefrom is performed but not illustrated.) Changes in the ratio 

35 of green to red along each of the chromosomes in the reference spread then indicate regions of increased or decreased 
sequence copy number in the tumor. Those ratio changes may result in color variations from red to yellow to green on 
the reference spread. 

[0064] Figure 6 graphically and schematically explains the kinetics underlying the CGH embodiment illustrated in 
Figure 5. In the center is one of the chromosomes of the reference chromosome spread, a normal human chromosome 
40 in this case. The darkness of the shading on the reference chromosome shows the ratio of green to red intensity along 
the chromosome. 

[0065] In the amplified region, the green/red ratio is much higher than in the normal region, whereas in the deleted 
region the green/red ratio is less than in the normal region. The arrows from examples of each of the different green/ 
red intensity regions point to kinetic curves that indicate the build-up of green (solid line for the tumor DNA) and red 
45 (dashed line for the normal DNA) signals during the hybridization. In the normal region, upper left graph, the red and 
green signals build together. (They have been normalized to be equal for the purposes of this explanation.) In the 
amplified region, upper right, the green (tumor) signal builds up much more rapidly than the red (normal) signal, the 
green/red ratio being approximately the level of amplification (given the normalization to the normal part of the chro- 
mosome). 

so [0066] In the lower left of Figure 6, the signal build-up for the duplicated region is shown; the green (tumor) signal 
is 50% brighter than the red (normal) signal. In the lower right, the build-up for a deleted region is schematically de- 
scribed; the green (tumor) signal is 50% dimmer than the red (normal) signal. The ratio approach of this CGH embod- 
iment further normalizes for the frequent finding that hybridization to some chromosomes in a. spread is intrinsically 
brighter than that for others because of differences in the local hybridization environment. 

55 [0067] Figure 7 graphically illustrates the correlation of the number of X chromosomes in five fibroblast celt lines and 
the average green-to-red ratio of the X chromosome(s) relative to the same ratio for the autosomes. 
[0068] Figure 8 illustrates green-to-red fluorescence ratio profiles of chromosomes 1.9,11,16 and 1 7 after compar- 
ative genomic hybridization with breast cancer cell line 600PE (green) and with a normal DNA (red). The profiles reflect 
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the relative copy number of the chromosomal regions. Fluorescence in situ hybridization (FISH) with 16p and 16q 
cosmid probes to interphase and metaphase 600PE cells indicated that there were two signals with 1 6p cosmid probes 
and one signal from the 16q cosmid probes. That information on the absolute copy number of those loci provided by 
FISH permits interpretation of the ratio 1 .0 as indicating that there are two copies of the sequence throughout the 
genome. 

[0069] The dip in the profile at 1p34 through 1p36 may represent a previously unsuspected small interstitial deletion; 
however, that observation has not yet been independently verified with specific probes for that region. 
[0070] Centromeric and heterochromatic regions of the genome are not included in the analysis because the Cot-1 
DN A partially blocks signals in those regions, and the large copy number polymorphisms between individual sequences 
at those loci effect unreliable ratio data. 

[0071] Figure 9A and 9B respectively provide green-to-red fluorescence ratio profiles of chromosome 8 (Figure 9A) 
and chromosome 2 (Figure 9B) after comparative genomic hybridization respectively with COLO 320 HSR (human 
colon adenocarcinoma cell line) and NCI H69 (small cell lung carcinoma cell line) cell line DNAs (green) and with 
normal human DNA (red). 

[0072] In Figure 9A, the myc locus at 8q24 shows a highly elevated green-to-red ratio, which is consistent with the 
known high level amplification of myc in the COLO 320HSR cell line. 

[0073] In Figure 9B, three regions of amplification are seen on chromosome 2. The signal at 2p24 corresponds to 
the location of N-myc known to be amplified in the NCI-H69 ceil line. The two other regions with a highly increased 
green-to-red fluorescence ratio, at 2p21 and 2q21 , were not previously known to be amplified in the NCI-H69 cell line. 

DETAILED DESCRIPTION 

[0074] Comparative Genomic Hybridization (CGH) has also been termed Copy Ratio Reverse Cytogenetics (CRRC), 
competition hybridization and quantitative in situ ratio karyotyping (QUIRK). Further, in the embodiment wherein fluor- 
ochromes are used as labels : it has been termed competition FISH (fluorescence in situ hybridization). CGH specifically 
provides methods whereby amplifications, duplications and/or deletions can be identified in an immediate overview of 
a genome. 

[0075] CGH provides methods for determining variations in the copy number of different elements in a mixture of 
nucleic acid sequences (for example, genomic DNA isolated from a tumor) as a function of the location of those se- 
quences in the genome of a reference organism (for example, the genome of a normal cell from the same species). 
The methods comprise the use of in situ hybridization of the nucleic acid sequence mixture to a chromosome spread 
of the reference organism, and measuring the intensity of the hybridization at different locations along the target chro- 
mosomes. Exemplary methods are schematically outlined in Figures 1 -6. Those illustrative examples are not exhaustive 
but suggest the wide range of variations and other uses of the basic approach. 

[0076] As the figure descriptions indicate, it is critical that signals from repetitive sequences do not dominate the 
signal from the subject nucleic acid pool, and that they be removed from the pool or that their signals be suppressed 
as necessary. It is preferred to exclude sequences from the hybridization or block sequences in the hybridization mixture 
that could bind to multiple clearly separated positions on the chromosomes, for example, sites that are on different 
chromosomes, or that are on the same chromosome but are well-separated. In many applications of CGH : it is the 
high copy repetitive sequences, such as Alu, Kpn, Lines, and alpha-satellites among others, that are removed from 
the labeled subject nucleic acid and/or which are blocked and/or the binding sites therefor are blocked. Described 
herein are methods to remove and/or block those repetitive signals. It should be noted that nucleic acid sequences in 
the labeled nucleic acid that bind to single copy loci are substantially retained in the hybridization mixture of labeled 
subject nucleic acids, and such single copy sequences as well as their binding sites in the reference chromosome 
spread remain substantially unblocked relative to the repetitive sequences that bind to multiple loci (that is. loci that 
are visually distinguishable) both before and during the hybridization. 

[0077] The methods of this invention provide the means to identify previously unknown regions of amplification and 
deletion. For example, one embodiment of CGH as detailed in Example 1 herein provides an efficient method that 
gives an immediate overview of a genome identifying all regions that are amplified greater than about five-fold to ten- 
fold as well as at least large deletions. More sensitive embodiments that can identify smaller amplifications and deletions 
are also disclosed. 

[0078] Nanogram quantities of the subject nucleic acids are required for the CGH methods of this invention. Paraffin 
embedded tumor sections can be used as well as fresh or frozen material. Snap frozen material from normal and 
malignant tissue are preferred (or mRNA isolation. 

[0079] Standard procedures can be used to isolate the required nucleic acid from the subject cells. However, if the 
nucleic acid, for example, DNA or mRNA, is to be extracted from a low number of cells (as from a particular tumor 
subregion) or from a single cell, it is necesary to amplify that nucleic acid, by a polymerase chain reaction (PCR) 
procedure or by a non-polymerase chain reaction (non-PCR) procedure. PCR and preferred PCR procedures are 
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described infra . Exemplary non-PCR procedures include the ligase chain reaction (LCR) and linear amplification by 
use of appropriate primers and their extension (random priming). 

[0080] Some of the various embodiments of CGH are illustrated, particularly in Figures 1 -6. In the embodiment illus- 
trated in Figures 5 and 6, wherein a subject nucleic acid, in this case, human genomic DNA. that is labeled differently 
from another subject nucleic acid, amplifications and/or deletions are indicated by a change in ratio between the different 
signals, rather than just a change in signal intensity. 

[0081] The representative examples, concerning CGH of Examples t, 2 and 3 below involve the hybridizations of 
tumor cell line DNA to normal human metaphase spreads. However, there are many permutations and combinations 
of pairwise and multiple hybridizations of different nucleic acids from different genomes all of which are considered to 
be within the scope of this invention. 

[0082] For example, labeled tumor cell line DNA and differently labeled human genomic DNA could be simultaneously 
hybridized to a metaphase spread of a tumor cell line metaphase spread. Further, DNA from a primary tumor and that 
from its metastasis could be differently labeled and hybridized in a CGH method to a normal human metaphase or to 
a related tumor cell line metaphase. Those are just some of the many examples of CGH. 

[0083] Although the examples herein concern the hybridizations of the DNA from breast cancer cell lines and primary 
tumors to normal human metaphase spreads, it will be clear to anyone skilled in the art that CGH is not limited to 
studying genomes of cancer cells or to the results of hybridizing abnormal genomes to normal genomes. CGH permits 
the comparison of nucleic acid sequence copy frequencies of any two or more genomes, even genomes of different 
species if their nucleic acid sequences are sufficiently complementary to allow for meaningful interpretation. It should 
be noted regarding interspecies comparisons that the information obtained by CGH includes not only an assessment 
of relative copy number but also that of sequence divergence. 

[0084] It will also be clear tathose skilled in the art that hybridization with nucleic acid other than chromosomal DNA, 
such as messenger RNA (mRNA) or complementary DNA (cDNA) of subject cells can be used to determine the location 
and level of expression of genes in those cells. Conventional methodology is used to extract mRNA from a cell or cell 
population, and to synthesize in vitro cDNA by reverse transcription. 

[0085] CGH does not require the preparation of condensed chromosomes, for example, metaphase, prophase or 
other condensed chromosomal states : of the subject genomes. Thus, genomes from which metaphase, prophase or 
otherwise condensed chromosomal spreads are difficult, time-consuming or not possible to prepare at least in good 
quality, for example, genomes of tumor cells or fetal cells can be studied by CGH. 

[0086] In CGH, labeled subject nucleic acids, for example, labeled tumor DNA, is hybridized to a reference genome, 
for example, a normal human metaphase spread, under conditions in which the signal from amplified, duplicated and/ 
or deleted nucleic acid sequences from the labeled nucleic acid can be visualized with good contrast. Such visualization 
is accomplished by suppressing the hybridization of repetitive sequences that bind to multiple loci including the high 
copy interspersed and clustered repetitive sequences, such as, Alu, Kpn : Lines, alpha-satellites among others, using 
unlabeled total human genomic nucleic acid, preferably DNA, and/or the repeat-enriched (Cot-1) fraction of genomic 
DNA, and/or by removing such repetitive sequences from the hybridization mixture. In providing the detection sensitivity 
required, the extent of suppression of the hybridization of repetitive sequences and/or removal thereof can be adjusted 
to the extent necessary to provide adequate contrast to detect the differences in copy number being sought; for example, 
subtler copy number changes may require the suppression or removal of lower level repetitive sequences. 
[0087] The relative concentrations and/or labeling densities of the differently labeled nucleic acids in a hybridization 
mixture may be adjusted for various purposes. For example, when using visual observation or photography of the 
results, the individual color intensities need to be adjusted for optimum observability of changes in their relative inten- 
sities. Adjustments can also be made by selecting appropriate detection reagents (avidin, antibodies and the like), or 
by the design of the microscope fitters among other parameters. When using quantitative image analysis, mathematical 
normalization can be used to compensate for general differences in the staining intensities of different colors. 
[0088] The kinetics of the CGH hybridizations are complicated. Since the subject nucleic acids are frequently double 
stranded, complementary sequences will reassociate in the hybridization mix as well as hybridizing to the target. Such 
reassociation may result in a more rapid decrease in concentration of the high copy sequences than the low copy ones, 
thereby making the signal intensity variations on the reference chromosomes less pronounced than the copy differences 
in the original subject DNAs. In addition, nonspecific binding of the labeled subject DNAs to the slide, coverslip, etc. 
may generally reduce the concentration of that labeled subject nucleic acid during the hybridization. Those skilled in 
the art will recognize numerous methods of optimizing the quantitative aspects of CGH : such as : mathematical cor- 
rection of digital images, supplying freshly denatured subject DNA during the hybridization, and adding unlabeled 
genomic DNA in excess to dominate the reassociation rates. The term "saturation" is defined in the context of hybrid- 
ization kinetics. 

[0089] The resolution of CGH is presently at a level that can be seen through a light microscope, as is traditional 
cytogenetic staining. Thus, if a small sequence in a subject nucleic acid is amplified, to be seen as a signal in a subject 
genome, it must be amplified enough times for its signal to be able to be visualized under a light microscope. For 
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example, the locus for erbB-2 which is relatively small (very approximately, a few hundred kb), needs to be amplified 
at least greater than five times to be visually distinguishable under a light microscope when the CGH embodiment used 
in Example 1 is employed. On the other hand, if a large section of a chromosome is present at increased frequency in 
a subject nucleic acid, the signal from that region would show up in the reference genome at a much lower level of 
5 amplification. 

[0090] The term "labeled" is herein used to indicate that there is some method to visualize nucleic acid fragments 
that are bound to the target, whether or not the fragments directly carry some modified constituent. A section infra 
entitled "Labeling the Nucleic Acid Fragments of the Subject Nucleic Acids" describes various means of directly labeling 
the probe and other labeling means by which the bound probe can be detected. 

10 [0091] The phrase "antenna cell line" is herein used to indicate a reference genome that has one or more known 
significant genetic aberrations, for example, a cell line known to have an oncogene that is highly amplified, for example, 
in large homogeneously staining regions (HSRs). The amplified regions of that cell line would thus provide a much 
bigger target site than a normal chromosome spread. Thus, observation of the signal from such a large target site 
would be easier in that on average the signal would be brighter from amplified target sequences in the reference 

15 genome as provided by such an antenna cell line. Subject nucleic acids extracted from, for example, a number of tumor 
cells, could be tested by a CGH hybridization to such an antenna cell line to see tf it also contained amplification(s) of 
the oncogene known to be amplified in the cell line. 

[0092] When an antenna cell line is used as the reference genome, there are instances wherein it can be used in 
interphase rather than as a chromosome spread. For example , if one is checking to see if a certain oncogene is amplified 
20 or not in the subject nucleic acid, interphase CGH is sufficient. However, the maximum amount of information is provided 
when condensed chromosome spreads are used. 

[0093] A base sequence at any point in the genome can be classified as either "single-copy" or "repetitive". For 
practical purposes the sequence needs to be long enough so that a complementary probe sequence can form a stable 
hybrid with the target sequence under the hybridization conditions being used. Such a length is typically in the range 

25 of several tens to hundreds of nucleotides. 

[0094] A "single-copy sequence" is that wherein only one copy of the target nucleic acid sequence is present in the 
haploid genome. "Single-copy sequences" are also known in the art as "unique sequences". A probe complementary 
to a single-copy sequence has one binding site in hapioid genome. A "repetitive sequence" is that wherein there is 
more than one copy of the same target nucleic acid sequence in the genome. Each copy of a repetitive sequence need 

30 not be identical to all the others. The important feature is that the sequence be sufficiently similar to the other members 
of the family of repetitive sequences such that under the hybridization conditions being used, the same fragment of 
probe nucleic acid is capable of forming stable hybrids with each copy. 

[0095] Herein; the terms repetitive sequences, repeated sequences and repeats are used interchangeably 
[0096] The phrase "metaphase chromosomes" in herein defined to encompass the concept of "condensed chromo- 

35 somes" and is defined to mean not only chromosomes condensed in the prophase or metaphase stage of mitosis but 
any condensed chromosomes, for example, those condensed by premature chromosome condensation or at any stage 
in the cell cycle wherein the chromosome can be visualized as an individual entity. It is preferred that the chromosomes 
in the reference genome be as long as possible but condensed sufficiently to be visualized individually. 
[0097] A subject nucleic acid is herein considered to be the same as another nucleic acid if it is from a member of 

^0 the same sex of the same species and has no significant cytogenetic differences from the other nucleic acid. For 
example, the DNA extracted from normal lymphocytes of a human female is considered for the purposes of this invention 
to be the same nucleic acid as that of DNA from normal cells of a human female placenta. 
[0098] The following abbreviations are used herein: 

45 Abbreviations 

[0099] 

AAF - N-acetoxy-N-2-acetyl-aminof)uorene 

50 

ATCC - American Type Culture Collection 

BN - bicarbonate buffer with NP-40 

55 Brd/ Urd - bromodeoxyuridine 



BRL - 



Bethesda Research Laboratories 
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bp- 


base pair 


CCD- 


charge coupled device 


CGH - 


Comparative Genomic Hybridization 


Chr. - 


chromosomal 


CML - 


chronic myelogenous leukemia 


CRRC- 


Copy Ratio Reverse Cytogenetics 


DAPI- 


4,6-diamidino-2-phenylindole 


dATP- 


deoxyadenosine triphosphate 


DCS- 


as in fluorescein-avidin DCS (a commercially available cell sorter grade of fluorescein Avidin D) 


dCTP- 


deoxycytosine triphosphate 


dGTP * 


deoxyguanosine triphosphate 


Dl- 


DNA index 


DM - 


double minute chromosome 


dNTP - 


deoxynucleotide triphosphate 


dTTP - 


deoxythymidine triphosphate 


dUTP- 


deoxyuridine triphosphate 


EDTA - 


ethylenediaminetetraacetate 


E/P- 


estrogen/progesterone 


FISH- 


fluorescence in situ hybridization 


FACS - 


fluorescence-activated cell sorting 


FITC - 


fluorescein isothiocyanate 


HPLC- 


high performance liquid chromatography 


HSR - 


homogeneously staining region 


ISCN- 


International System for Cytogenetic Nomenclature 


IB- 


isolation buffer 


kb- 


kilobase 


kDa- 


kilodalton 


LCR - 


ligase chain reaction 


LOH - 


loss of heterozygosity 
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Mb - megabase 
met. - metastasis 
min - minute 
ml - milliliter 
mM - milliMole 
mm - millimeter 
ng - nanogram 

NIGMS - National Institute of General Medical Sciences 

NP-40 - non-ionic detergent commercially available from Sigma as Nonidet P-40 (St. Louis, MO) 

PBS - phosphate-buffered saline 

PCR - polymerase chain reaction 

PHA - phytohemagglutinin 

PI - propidium iodide 

PI. - pleural 

PMSF - phenylmethylsulfonyl fluoride 

PN buffer - mixture of 0.1 M NaH 2 P0 4 and 0.1 M Na 2 HP0 4 , pH 8; 0.1% NP-40 

PNM buffer - Pn buffer plus 5% nonfat dry milk (centrifuged); 0.02% Na azide 

QUIPS - quantitative image processing system 

QUIRK - quantitative in situ ratio karyotyping 

Rb-1 - retinoblastoma tumor suppressor gene 

RFLP - restriction fragment length polymorphism 

RPM - revolutions per minute 

SD - Standard Deviation 

SDS - sodium dodecyl sulfate 

SSC - 0.15 M NaCI/0.015 M Na citrate, pH 7 

Td - doubling time 

ug - microgram 

ul - microliter 

um - micrometer 
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uM - micromole 

VNTR - variable number tandem repeat 

5 [0100] Resolution of differences in copy number can be improved by the use of image analysis and by averaging 
the results from hybridizations of the subject nucleic acids to multiple condensed chromosome spreads. Using such 
methods, the background signal (noise) can be differentiated from actual nucleic acid sequence copy number differ- 
ences. 

10 Image Analysis: 

[0101] An image analysis system, preferably computer-assisted, can be used to enhance and/or accurately quanti- 
tate the intensity differences between and/or among the signals from a hybridization and the background staining 
differences for more accurate and easier interpretation of results. Image analysis and methods to measure intensity 
15 are described, for example, in Hiraoka et at., Science, 238: 36-41 (1 987) and Aikens et al., Meth. Cell Biol., 29: 291 -31 3 
(1989). In such an image analysis system, it is preferred to use a high quality CCD camera whose intensity response 
is known to be linear over a wide range of intensities. 

[0102] The components of a particular quantitative image processing system (QUIPS) are described in Example 1 
under the subheading Fluorescence Microscopy and Interpretation of Results. As exemplified in Example 1 , a com- 

20 puter-assisted image analysis system with a fitterwheel is used so that the images from the signals and counterstaintng 
of the DNA are superimposed on one image. Pseudocolors, that is, colors that are not exactly spectrally converted, 
can be displayed. Contrast stretching, wherein the differences between the intensity levels of the signals and back- 
ground staining differences are enhanced by adjusting controls of the image analysis system. Thresholding can also 
be used wherein the background staining can be assigned a value close to zero so it would barely appear in the 

25 processed image from such a system. Similarly, computer analysis permits substraction of background, smoothing of 
fluctuations in the signals, accurate intensity and ratio calculations and the ability to average signals on chromosomes 
in multiple spreads. • ■ " 

Absolute Copy Numbers: 

30 

[01 03] Hybridization of the subject DNAs to the reference chromosomes gives information on relative copy numbers 
of sequences. Some additional normalization is required to obtain absolute copy number information. One convenient 
method to do this is to hybridize a probe, for example a cosmid specific to some single locus in the normal haploid 
genome, to the interphase nuclei of the subject cell or cell population(s) (or those of an equivalent cell or representative 

35 cells therefrom, respectively). Counting the hybridization signals in a representative population of such nuclei gives 
the absolute sequence copy number at that location. Given that information at one locus, the intensity (ratio) information 
from the hybridization of the subject DNA(s) to the reference condensed chromosomes gives the absolute copy number 
over the rest of the genome. In practice, use of more than one reference locus may be desirable. In this case, the best 
fit of the intensity (ratio) data through the reference loci would give a more accurate determination of absolute sequence 

40 copy number over the rest of the genome. 

[0104] Thus, the CGH methods of this invention combined with other well-known methods in the art can provide 
information on the absolute copy numbers of substantially all RNA or DNA sequences in subject cell(s) or cell population 
(s) as a function of the location of those sequences in a reference genome. For example, one or more chromosome- 
specific repeat sequence or high complexity painting probes can be hybridized independently to the interphase nuclei 

45 of cells representative of the genomic constitution of the subject cell(s) or cell population(s). Whole chromosome paint- 
ing probes are now available for all the human chromosomes [Collins et al., Genomics, 11 : 997-1 006 (1 991 )]. Specific 
repeat-sequence probes are also available [Trask et aL Hum. Genet., 78 : 251 (1988) and references cited therein; 
and commercially available from Oncor (Gaithersburg, MD, USA)]. Hybridization with one or more of such probes 
indicates the absolute copy numbers of the sequences to which the probes bind. 

50 [0105] For such interphase analysis, painting probes with a complexity of from about 35 kb to about 200 kb, are 
preferred; probes from about 35 kb to about 100 kb are further preferred; and still more preferred are probes having 
a complexity of from about 35 kb to 40 kb, for example, a cosmid probe. Exemplary of such locus-specific painting 
probes are any cosmid, yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), and/or p1 phage 
probes as appropriate, preferably to the arms of a selected chromosome. Such cosmid probes, for example, are com- 

55 mercialty available from Clontech [South San Francisco, CA (USA)] which supplies cosmid libraries for all the human 
chromosomes. Another example of a cosmid probe that could be used in -such methods of this invention would be a 
3p cosmid probe called CC13-787 obtained from Yusuke Nakamura, M.D., Ph.D. [Division of Biochemistry, Cancer 
Institute, Toshima, Tokyo, 170, Japan]. Its isolation and mapping to 3p21.2-p21.1 is described in Yamakawa et al. r 
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Genomics , 9(3): 536-543 (1991). Another example would be a 3q cosmid probe named J14R1A1 2 obtained from Wen- 
Lin Kuo [Biomedical Department, P.O. Box 5507 (L-452), Lawrence Livermore National Laboratory Livermore, CA 
94550 (USA)). For interphase analysis, preferred repeat sequence probes are centromeric-specific and/or peri-cen- 
tromeric-specific repeat sequence probes. Such a centromeric-probe is, for example, the chromosome 1 7 peri-centro- 

5 meric repeat probe (cosmid ck17.10) and the alpha satellite repeat probe for the centromeric region of chromosome 
8, both of which are described in Example 1 infra. A variety of repeat sequence probes are commercially available 
from Oncor [Gaithersburg, MD (USA)]. However the locus-specific painting probes are preferred over the repeat se- 
quence probes for the methods of this invention to determine absolute copy numbers of nucleic acid sequences. 
[0106] Further, when the subject nucleic acid sequences are DNA, the reference copy numbers can be determined 

10 by Southern analysis. When the subject nucleic acid sequences are RNA, the reference copy numbers can be deter- 
mined by Northern analysis. 

[0107] Those reference copy numbers or reference frequencies provide a standard by which substantially all the 
RNA or DNA sequences in the subject cells or cell populations can be determined. CGH methods are used to determine 
the relative copy numbers of the rest of the sequences. However absolute copy numbers require a standard against 
15 which the results of CGH can be determined. Otherwise the CGH procedures would have to be highly standardized 
and quantitated to see differences in the absolute copy numbers of sequences in a genome, for example, haploidy, 
triploidy, octaploidy, wherein there are 1, 3 and 8 copies of each of the chromosomes, respectively. 

PCR and Microdissection : 

20 

[0108] The mechanics of PCR are explained in Saiki et al., Science, 230 : 1350 (1985) and U.S. Patent Nos. 
4,683,195, 4,683,202 (both issued July 18, 1987) and 4,800,159 (issued January 24, 1989).] PCR offers a rapid, sen- 
sitive and versatile cell-free molecular cloning system in which only minute amounts of starting material are required. 
[0109] A preferred PCR method to amplify the subject nucleic acids for testing by CGH is a PCR adapter-linker 

25 amplification [Saunders et al., Nuc. Acids Res., 17 9027 (1 990); Johnson, Genomics, 6 : 243 (1 990) and PCT 90/00434 
(published August 9, 1 990)]. The labeled subject nucleic acid could be produced by such a adapter-linker PCR method 
from a few hundred cells; for example, wherein the subject nucleic acid is tumor DNA, the source DNA could be a few 
hundred tumor cells. Such a method could provide a means to analyse by CGH clonal sub-populations in a tumor. 
[0110] Another further preferred PCR method is a method employing a mixture of primers described in Meltzer et 

30 al., "Rapid Generation of Region Specific Probes by Chromosome Microdissection and their Application: A Novel Ap- 
proach to Identify Cryptic Chromosomal Rearrangements," Nature- Genetics, 1 (1): 24-28 (April 1 992). Microdissection 
of sites in the reference metaphase spread that produce signals of interest in CGH, would permit PCR amplification 
of nucleic acid sequences bound at such sites. The amplified nucleic acid could then be easily recovered and used to 
probe available libraries, as for example, cosmid libraries, so that the amplified sequences could be more rapidly 

35 identified. 

[0111] High copy repetitive sequences can be suppressed in amplifying the subject nucleic acid by PCR. The PCR 
primers used for such a procedure are complementary to the ends of the repetitive sequences. Thus, upon proper 
orientation, amplification of the sequences flanked by the repeats occurs. One can further suppress production of 
repetitive sequences in such a PCR procedure by first hybridizing complementary sequences to said repetitive se- 

40 quences wherein said complementary sequences have extended non-complementary flanking ends or are terminated 
in nucleotides which do not permit extension by the polymerase. The non-complementary ends of the blocking se- 
quences prevent the blocking sequences from acting as a PCR primer during the PCR process. Primers directed 
against the Alu and U repetitive DNA families have allowed the selective amplification of human sequences by inter- 
spersed repetitive sequence PCR (IRS-PCR) [Nelson et al., PNAS, 86: 6686 (1989); Ledbetter et al., Genomics, 6: 

45 475 (1990)]. ' ~ 

Archived Material 

[0112] An important aspect of this invention is that nucleic acids from archived tissue specimens, for example ! par- 
50 aff in-embedded or formal in -fixed pathology specimens, can be tested by the methods of CGH . Said nucleic acid cannot, 
of course, be prepared into chromosome spreads for traditional cytogenetic chemical staining. Also, it is difficult for 
large enough restriction fragments to be extracted from such material for other conventional research tools, such as 
Southern analysis. However, the nucleic acid from such specimens can be extracted by known techniques such as 
those described in Greer et al., Anatomic Pathology, 95(2): 117-124 (1991) and Dubeau et al., Cancer Res., 46: 
55 2964-2969 (1 986), and if necessary, amplified for testing by various CGH methods. Such nucleic acid can be amplified 
by using a polymerase chain reaction (PCR) procedure (described above), for example, by the method described in 
Greer et aL supra wherein DNA from paraffin-embedded tissues is amplified by PCR. 

[01 13] A particular value of testing such archived nucleic acid is that such specimens are usually keyed to the medical 
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records of the patients from whom the specimens were taken. Therefore, valuable diagnostic/prognostic associations 
can be made between the revealed cytogenetic state of patients' nucleic acid material and the medical histories of 
treatment and outcome for those patients. For example, information gathered by CGH can be used to predict the 
invasiveness of a tumor based upon its amplification and/or deletion pattern matched to associations made with similar 

5 patterns of patients whose outcomes are known. 

[0114] Analogously, other nucleic acid that is fixed by some method, as, for example, archaelogical material pre- 
served through natural fixation processes, can also be studied by CGH procedures. As indicated above, copy number 
differences between species provide information on the degree of similarity and divergence of the species studied. 
Evolutionarily important linkages and disjunctions between and among species, extant or extinct, can be made by 

10 using the methods of CGH. 

Tumor Cytogenetics 

[01 15] CGH provides the means to assess the association between gene amplification and/or deletion and the extent 
* 5 of tumor evolution. Correlation between amplification and/or deletion and stage or grade of a cancer may be prognos- 
tically important because such information may contribute to the definition of a genetically based tumor grade that 
would better predict the future course of disease with more advanced tumors having the worst prognosis. In addition, 
information about early amplification and/or deletion events may be useful in associating those events as predictors 
of subsequent disease progression. Gene amplification and deletions as defined by CGH to, for example, normal 
20 metaphase spreads (genomic site, intensity of the signal and/or differences in signal ratios, and number of different 
genomic sites at which the copy number differences occur) can be associated with other known parameters such as 
tumor grade, histology, Brd/Utd labeling index, hormonal status, nodal involvement, tumor size, survival duration and 
other tumor properties available from epidemiological and biostatistical studies. For example, tumor DNA to be tested 
by CGH could include atypical hyperplasia, ductal carcinoma in situ, stage l-lll cancer and metastatic lymph nodes in 
25 order to permit the identification of associations between amplifications and deletions and stage. 

[0116] The associations made may make possible effective therapeutic intervention. For example, consistently am- 
plified regions may contain an overexpressed gene, the product of which may be able to be attacked therapeutically 
(for example, the growth factor receptor tyrosine kinase. p185 HER2) 

[0117] CGH hybridizations of nucleic acids from cells of primary cancers that have metastasized to other sites can 
30 be used to identify amplification and/or deletion events that are associated with drug resistance. For example, the 
subject nucleic acids to be analysed could be selected so that approximately half are from patients whose metastatic 
disease responded to chemotherapy and half from patients whose tumors did not respond. If gene amplification and/ 
or deletion is a manifestation of karyotypic instability that allows rapid development of drug resistance, more amplifi- 
cation and/or deletion in primary. tumors from chemoresistant patients than in tumors in chemosensitive patients would 
35 be expected. For example, if amplification of specific genes is responsible for the development of drug resistance, 
regions surrounding those genes would be expected to be amplified consistently in tumor cells from pleural effusions 
of chemoresistant patients but not in the primary tumors. Discovery of associations between gene amplification and/ 
or deletion and the development of drug resistance may allow the identification of patients that will or will not benefit 
from adjuvant therapy. 

40 [01 1 8] Once a new region of amplification or deletion has been discovered by CGH, it can be studied in more detail 
using chromosome-specific painting [Pinkel et al., PNAS (USA), 85 : 9138-9142 (1988); EP Publication No. 430 : 402 
(June 5, 1 991)] with a collection of probes that span the amplified or deleted region. Probes to amplified regions will 
show more signals than centromeric signals from the same chromosome, whereas probes to nonamplified regions will 
show approximately the same number of test and centromeric signals. For example, the amplified regions on 1 7q22-23 

45 and 20qter (discussed as newly discovered regions of amplification in Example 1 ) show variability in size from tumor 
to tumor using CGH (the 1 7q22-23 region more markedly); it can be expected that the region containing the important 
gene(s) can be narrowed by mapping the regions of amplification in multiple tumors in more detail to find the portion 
that is amplified in all cases. Probes for those studies can be selected, for example from specific cosmid libraries 
produced by the National Laboratory Gene Library Project and/or from the National Institute of Health (NIH) genomic 

so research projects. 

[0119] The c-erbB-2 oncogene, also referred to as HER-2 orneu, encodes for a 185 kilodalton (Kd) protein. Studies 
have reported c-erbB-2 gene amplification in human mammary tumor cell lines. [Kraus et al., EMBO J. 6 : 605-610 
(1987); van de Vijver et al., Mol. Cell Biol., 7 : 2019-2023 (1987).] Also, c-erbB-2 gene amplification in human breast 
cancer has been shown to be associated with disease behavior, and may be a predictor of clinical outcome. [Slamon 
55 et al., Science, 235 : 177-182 (1987); Berger et al., Cancer Res., 48: 1238-1243 (1988); Zhou et al., Cancer Res. : 47 : 
6123-6125 (1987); and Venter et al., Lancet, 11 : 69-71 (1987)]. C-erbB-2 has also been shown to be amplified in 
ovarian cancers. [Alitalo and Schwab, Advances in Cancer Res., 47 : 235-281 (1986).] 

[0120] C-myc is a proto-oncogene which is the cellular homolog of the transforming gene of the chicken retrovirus 
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MC29. In humans, c-myc lies on the long arm of chromosome 8, at band 124, and spans about 5 kilobase pairs. The 
myc protein is a phosphoprotein present in the nucleus. The normal function of c-myc is unknown; however, it also 
certainly plays a role in cell division, and is expressed in normally growing cells as well as in tumor cells. It is now 
widely believed that translocations involving c-myc lead to altered transcription of the gene, contributing to malignant 
5 transformation. 

[0121] Sequences from N- myc member of the myc gene family, have been shown to be amplified as much as a 
thousandfold in some neuroblastomas. N- myc amplifications are usually seen in the later stage HI and IV tumors. Some 
small-cell lung carcinomas also have amplified myc genes in double minute chromosomes (DMs) and homogeneously 
staining regions (HSRs). Myc has also been shown to be amplified in colon cancer. [Alitato and Schwab, supra .] Again 
10 such amplifications are found in late stages of tumor development, in the so-calted variant cells that exhibit a more 
malignant behavior. Amplifications can involve either c-myc : N-myc or another member of the myc gene family, L- myc . 
[Watson et al. t supra at pp. 1084-1086]. 

[0122] In addition, overexpression has been observed for the p-glycoprotein gene family associated with multi-drug 
resistance and for drug metabolizing enzymes such as P450 containing enzymes and glutathione S-transf erase. [Fair- 
's child and Cowan, J. Radiation Oncol. Biol. Phys., 20: 361-367 (1990).] 

[01 23] Identification of amplified and/or deleted genes is important to the management of cancer, for example, breast 
cancer, for several reasons: 

1) to improve prognostication; 

20 2) to detect amplification and/or deletion events that are associated with the development of drug resistance; and 

3) to improve therapy. 

For example, in regard to improving prognostication, in breast cancer the amplification of oncogenes, such as int-2, 
erbB-2 and myc occur frequently and have been associated with aggressive growth and poor prognosis in some studies. 

25 [Schwab and Amier, Genes, Chromosomes & Cancer. 1: 181-193 (1990).] In regard to reason (2), gene amplification 
has clearly been shown to lead to drug resistance in vitro (for example, amplification of the dihydrofolate reductase 
gene confers resistance to methotrexate), and is likely to occur in patients undergoing therapy as well (for example, 
as a result of over expression of glutathione S-transferase and p-giycoprotein). [Fairchild and Cowan, supra ]. Thus, 
the identification of resistance-linked genes would have a major impact on therapy by allowing therapy modification 

30 as resistance-related gene amplification occurs. Therapy could be improved by targeting for specific therapy, tumors 
that overexpress specific amplified genes. 

Prenatal Diagnosis 

35 [0124] Prenatal screening for disease-linked chromosome aberrations (e.g., trisomy 21) is enhanced by the rapid 
detection of such abberrations by the methods and compositions of this invention. CGH analysis is particularly signif- 
icant for prenatal diagnosis in that it yields more rapid results than are available by cell culture methods. 

Removal of Repetitive Sequences and/or Disabling the Hybridization Capacity of Repetitive Sequences 

40 

[0125] The following methods can be used to remove repetitive sequences and/or disable the hybridization capacity 
of such repetitive sequences. Such methods are representative and are expressed schematically in terms of procedures 
well known to those of ordinary skill the art, and which can be modified and extended according to parameters and 
procedures well known to those in the art. 

45 [0126] Bulk Procedures. In many genomes, such as the human genome, a major portion of distributed (or shared) 
repetitive DNA is contained in a few families of highly repeated sequences such as Alu. These methods primarily exploit 
the fact that the hybridization rate of complementary nucleic acid strands increases as their concentration increases. 
Thus, if a mixture of nucleic acid fragments is denatured and incubated under conditions that permit hybridization, the 
sequences present at high concentration will become double-stranded more rapidly than the others. The double-strand- 

50 ed nucleic acid can then be removed and the remainder used in the hybridizations. Alternatively, the partially hybridized 
mixture can be used as the subject nucleic acid, the double-stranded sequences being unable to bind to the target. 
The following are methods representative of bulk procedures that are useful for disabling the hybridization capacity of 
repetitive sequences or removing those sequences from a mixture. 

[0127] Self-reassociation . Double-stranded nucleic acid in the hybridization mixture is denatured and then incubated 
55 under hybridization conditions for a time sufficient for the high-copy sequences in the mixture to become substantially 
double-stranded. The hybridization mixture is then applied to the reference chromosome spread. The remaining labeled 
single-stranded copies of the highly repeated sequences may bind throughout the reference chromosome spread 
producing a weak, widely distributed signal. . 
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[01 28] Use of blocking nucleic acid. Unlabeled nucleic acid sequences which are complementary to those sequences 
in the hybridization mixture whose hybridization capacity it is desired to inhibit are added to the hybridization mixture. 
The subject nucleic acids and blocking nucleic acid are denatured, if necessary and incubated under appropriate 
hybridization conditions. The sequences to be blocked become double-stranded more rapidly than the others, and 

5 therefore are unable to bind to the reference spread when the hybridization mixture is applied to the spread. In some 
cases, the blocking reaction occurs so quickly that the incubation period can be very short, and adequate results can 
be obtained if the hybridization mix is applied to the spread immediately after denaturation. Further, the probe and the 
target can be simultaneously denatured in some cases. A blocking method is generally described in the context of 
Southern analysis by Sealy et aL, "Removal of Repeat Sequences form Hybridization Probes". Nucleic Acid Research , 

10 13:1905 (1 985). Examples of blocking nucleic acids include genomic DNA, a high-copy fraction of genomic DNA and 
particular sequences as outlined below. 

i. Genomic DNA. Genomic DNA contains all of the nucleic acid sequences of the organism in proportion to their 
copy-number in the genome. Thus, adding genomic DNA to the hybridization mixture increases the concentration 

'5 of the high-copy repeat sequences more than low-copy sequences, and therefore is more effective at blocking the 

former. 

ii. High-copy fraction of genomic DNA. Fractionating the genomic DNA to obtain only the high-copy sequences 
and using them for blocking can be done, for example, with hydroxyapatite as described below. 

20 Removal of Sequences . 

[01 29] Hydroxyapatite . Single- and double-stranded nucleic acids have different binding characteristics to hydroxya- 
patite. Such characteristics provide a basis commonly used for fractionating nucleic acids. Hydroxyapatite is commer- 
ically available [e.g. : BioRad Laboratories, Hercules, CA (USA)]. The fraction of genomic DNA containing sequences 

25 with a particular degree of repetition, from the highest copy-number to singie-copy, can be obtained by denaturing 
genomic DNA, allowing it to reassociate under appropriate conditions to a particular value of C Q t, followed by separation 
using hydroxyapatite. The single- and double-stranded nucleic acid can also be discriminated by use of S1 nuclease. 
Such techniques and the concept of C D t are explained in Britten et al., "Analysis of Repeating DNA Sequences by 
Reassociation", in Methods in Enzymology, 29 : 363-41 8 (1 974). 

30 [0130] Reaction with immobilized nucleic acid. Removal of particular sequences can also be accomplished by at- 
taching single-stranded "absorbing" nucleic acid sequences to a solid support. Single-stranded source nucleic acid is 
hybridized to the immobilized nucleic acid. After the hybridization, the unbound sequences are collected and used in 
CGH. For example, human genomic DNA can be used to absorb repetitive sequences from the subject nucleic acids. 
One such method is described by Brison et al., "General Method for Cloning Amplified DNA by Differential Screening 

35 with Genomic Probes," Molecular and Cellular Biology, 2 : 578-587 (1 982). Briefly, minimally sheared human genomic 
DNA is bound to diazonium cellulose or a like support. The source DNA, appropriately cut into fragments, is hybridized 
against the immobilized DNA to C Q t values in the range of about 1 to 1 00. The preferred stringency of the hybridization 
conditions may vary depending on the base composition of the DNA. 

[0131] Prehybridization . Blocking of repeat sequence binding sites in the reference genome by hybridization with 
40 unlabeled complementary sequences will prevent binding of labeled sequences in the subject nucleic acids that have 
the potential to bind to those sites. For example, hybridization with unlabeled genomic DNA will render the high-copy 
repetitive sequences in the reference genome double-stranded. Labeled copies of such sequences in the subject 
nucleic acids will not be able to bind when they are subsequently applied. 

[01 32] In practice, several mechanisms can be combined to produce the desired contrast and sensitivity 

45 

Labeling the Nucleic Acid Fragments of the Subject Nucleic Acids 

[0133] There are many techniques available for labeling single- and double-stranded nucleic acid fragments of the 
subject nucleic acids. They include incorporation of radioactive labels : e.g. Harper et al. Chromosoma, 83: 431-439 

50 (1984); direct attachment of fluorochromes or enzymes, e.g. Smith et al., Nuc. Acids Res., 13 : 2399-2412 (1985), and 
Connolly et al, Nuc. Acids Res., 13 : 4485-4502 (1 985); and various chemical modifications of the nucleic acid fragments 
that render them detectable immunochemically or by other affinity reactions, e.g. Tchen et al.. "Chemically Modified 
Nucleic Acids as Immunodetectable Probes in Hybridization Experiments/' PNAS, 81: 3466-3470 (1984); Richardson 
et al., "Biotin and Fluorescent Labeling of RNA Using T4 RNA Ligase/' Nuc. Acids Res., 11 : 6167-6184 (1983); Langer 

55 et al., "Enzymatic Synthesis of Biotin-Labeled Polynucleotides: Novel Nucleic Acid Affinity Probes," PNAS, 78 : 
6633-6637 (1981); Brigati et al. , "Detection of Viral Genomes in Cultured Cells and Paraffin-Embedded Tissue Sections 
Using Biotin-Labeled Hybridization Probes," Virol., 126 : 32-50 (1983); Broker et al. : "Electron Microscopic Visualization 
of tRNA Genes with Ferritin-Avidin: Biotin Labels," Nuc. Acids Res., 5 : 363-384 (1978); Bayer et al., "The Use of the 



on 



EP 0 631 635 B1 



Avidin Biotin Complex as a Tool in Molecular Biology," Methods of Biochem. Analysis, 26: 1-45 (1980); Kuhlmann, 
Immunoenzyme Techniques in Cytochemistry (Weinheira Basel, 1984). Langer-Safer et al. p PNAS (USA), 79: 4381 
(1982): Landegent et al., Exp. Cell Res ., 153: 61 (1984); and Hopman et a!., Exp. Cell Res ., 169 : 357 (1987). Thus, 
as indicated, a wide variety of direct and/or indirect means are available to enable visualization of the subject nucleic 

5 sequences that have hybridized to the reference genome. Suitable visualizing means include various ligands, radio- 
nuclides, fluorochromes and other fluorescers, chemiluminescers, enzyme substates or co-factors, particles, dyes and 
the like. Some preferred exemplary labeling means include those wherein the probe fragments are biotinylated, mod- 
ified with N-acetoxy-N-2-acetylaminofluorene, modified with fluorescein isothiocyanate or other fluorochromes, modi- 
fied with mercury/TNP ligand, sulfonated, digoxigeninated or contain T-T dinners. 

10 [0134] A preferred method of labeling is tailing by terminal transferase labeling. Another preferred method is random 
priming with mixed sequence primers followed by polymerase extension. This has the additional feature of amplifying 
the amount of subject DNA, if several cycles are used, which is useful when only a small amount of DNA was originally 
obtained from the subject cell or cell population. 

[0135] The key feature of labeling is that the subject nucleic acid fragments bound to the reference spread be de- 
15 tectable. In some cases, an intrinsic feature of the subject nucleic acid, rather than an added feature, can be exploited 
for this purpose. For example, antibodies that specifically recognize RNA/DNA duplexes have been demonstrated to 
have the ability to recognize probes made from RNA that are bound to DNA targets [Rudkin and Stoltar, Nature, 265 : 
472-473 (1977)]. The RNA used is unmodified. Nucleic acid fragments can be extended by adding "tails" of modified 
nucleotides or particular normal nucleotides. When a normal nucleotide tail is used, a second hybridization with nucleic 
20 acid complementary to the tail and containing fluorochromes, enzymes, radioactivity, modified bases, among other 
labeling means, allows detection of the bound nucleic acid fragments. Such a system is commercially available from 
Enzo Biochem [Biobridge Labeling System; Enzo Biochem Inc., New York, N.Y. (USA)]. 

[01 36] Another example of a means to visualize the bou nd nucleic acid fragments wherein the nucleic acid sequences 
do not directly carry some modified constituent is the use of antibodies to thymidine dimers. Nakane et al., ACTA 
25 Histochem. Cytochem., 20 (2):229 (1987), illustrate such a method wherein thymine-thymine dimerized DNA (T-T DNA) 
was used as a marker for in situ hybridization. The hybridized T-T DNA was detected immunohistochemically using 
rabbit anti-T-T DNA anlibody. 

[0137] All of the labeling techniques disclosed in the above references may be preferred under particular circum- 
stances. Further, any labeling techniques known to those in the art would be useful to label the subject nucleic acids 
30 in of this invention. Several factors govern the choice of labeling means, including the effect of the label on the rate of 
hybridization and binding of the nucleic acid fragments to the chromosomal DNA, the accessibility ot the bound nucleic 
acid fragments to labeling moieties applied after initial hybridization, the mutual compatibility of the labeling moieties, 
the nature and intensity of the signal generated by the label, the expense and ease in which the label is applied, and 
the like. 

35 [0138] Several different subject nucleic acids, each labeled by a different method, can be used simultaneously. The 
binding of different nucleic acids can thereby be distinguished, for example, by different colors. 

In Situ Hybridization. 

40 [0139] Application of the subject nucleic acids to the reference chromosome spreads is accomplished by standard 
in situ hybridization techniques. Several excellent guides to the technique are available, e.g. ; Gall and Pardue, "Nucleic 
Acid Hybridization in Cytological Preparations," Methods in Enzymology, 21: 470-480 (1981); Henderson, "Cytological 
Hybridization to Mammalian Chromosomes,'' International Review of Cytology, 76 : 1-46 (1982); and Angerer et al., M in 
situ Hybridization to Cellular RNAs," in Genetic Engineering : Principles and Methods, Setlow and Hollaender, Eds., 

^5 Vol. 7, pgs. 43-65 (Plenum Press, New York, 1985). 

[01 40] Generally in situ hybridization comprises the following major steps: (1 ) fixation of tissue or biological structure 
to be examined, (2) prehybridization treatment of the biological structure to increase accessibility of target DNA, and 
to reduce nonspecific binding, (3) hybridization of the mixture of nucleic acids to the nucleic acid in the biological 
structure or tissue; (4) posthybridization washes to remove nucleic acid fragments not bound in the hybridization and 

50 (5) detection of the hybridized nucleic acid fragments. The reagents used in each of these steps and their conditions 
of use vary depending on the particular situation. 

[0141] Under the conditions of hybridization wherein human genomic DNA is used as an agent to block the hybrid- 
ization capacity of the repetitive sequences, the preferred size range of the nucleic acid fragments is from about 200 
bases to about 1000 bases, more preferably about 400 to 800 bases lor double-stranded, nick-translated nucleic acids 
55 and about 200 to 600 bases for single-stranded or PCR adapter-linker amplified nucleic acids. 

[0142] Example 1 provides details of a preferred hybridization protocol. Basically the same hybridization protocols 
as used for chromosome-specific painting as described in Pinkel et al., PNAS (USA), 85 : 9138-9142 (1988) and in EP 
Pub. No. 430,402 (published June 5, 1991) are adapted for use in CGH. 
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[0143] The following representative examples of performing CGH methods of this invention are for purposes of il- 
lustration only and are not meant to limit the invention in any way. 

Example 1 

DNA from Breast Cancer Lines Hybridized to Normal Metaphase Spreads 



[01 44] In this Example, methods of this invention to analyse genomes by Comparative Genomic Hybridization (CGH) 
are exemplified by hybridizations of breast cancer cell lines to normal metaphase spreads. The target metaphase 
spreads were pre-hybridized with unlabeled human placental DNA to block the high copy repeat sequences. In this 
representative example, the hybridization mixture containing the extracted labeled DNA from the cell lines contained 
unlabeled, repeat-enriched Cot-1 blocking DNA [obtained from Bethesda Research Laboratories (BRL), Gaithersburg, 
MD (USA]. 

[0145] The experiments outlined below include in the hybridization mixture for the subject genomes, that is, the 
breast cancer cell line DNAs, chromosome-specific repeat sequence probes and chromosome-specific painting probes. 
Those probes labeled with biotin were included as an adjunct for identifying chromosomes in the metaphase prepara- 
tions. The experiments were first performed without those chromosome-specific probes. Then each chromosome of 
interest was measured to determine its length which was considered along with other factors to determine its probable 
identity. The chromosome-specific probes were then used in the hybridization mixture to confirm the identity of the 
chromosome of interest. However, such probes are not necessary as the chromosomes could have been identified by 
the DAPI banding of the counterstain or by other chemical staining, such as staining with quinacrine, by a skilled 
cytogeneticist. 

Cell Lines and Isolation of DNA : 

[0146] Six established breast cancer cell lines: BT-474, SK-BR-3 : MCF-7, MDA-MB-361 , MDA-MB-468 and T-47D 
were obtained from the American Type Culture Collection [Rockville, Maryland (USA)]. The breast cancer cell line 
600MPE cell line was kindly provided by Dr. Helene S. Smith [Geraldine Brush Cancer Research Center San Francisco, 
CA (USA)]. Cell lines were grown until they became confluent. Cells were then trypsinized, pelleted by centrifugation 
at 1500 RPM for 5 minutes and washed twice in phosphate buffered saline. The DNA was then isolated as described 
by Sambrook et al., Molecular Cloning: A Laboratory Manual, Vol. 2: 9.1 6-9.1 9 [Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY (USA) 1989]. 

[0147] Details concerning the established human breast cancer cell lines used herein are as follows: 

BT-474 - Originated from a human primary cancer; obtained from the ATCC, catalog # HTB 20: 

SK-BR-3 - Originated from a human metastatic breast adenocarcinoma derived from a pleural effusion; obtained 

from the ATCC catalog # HTB 30; 
MDA-MB-361 - Originated as a metastatic tumor to the brain; obtained from the ATCC, catalog # HTB 27; 
MCF-7 - Originated from a human metastatic pleural effusion; obtained from the ATCC, catalog # HTB 22; 

T-47D - Originated as a human metastatic pleural effusion; obtained from the ATCC catalog # HTB 133; 

600MPE - Originated as a human metastatic pleural effusion; kindly provided by Dr. Helene S. Smith [Geraldine 

Brush Cancer Research Center, San Francisco, CA (USA)]; and 
MDA-MB-468 - Originated as a metastatic pleural effusion; obtained from the ATCC, catalog # HTB 132. 

Preparation of Normal Lymphocyte Metaphases : 



[01 48] Normal peripheral blood lymphocytes were stimulated by PHA, synchronized by methotrexate treatment and 
blocked in metaphase using 0.05 ug/ml colcemid. Cells were then centrifuged : washed and incubated in 75 mM KCI 
at 37°C for 15 minutes. Cells were then fixed in methanohacetic acid (3:1) and dropped onto slides. The slides were 
stored under nitrogen at -20°C. 

DNA Labeling : 



[0149] Cell line DNAs were labeled with digoxigenin-11-dUTP using nick translation [Rigby et al., J. Mol. Biol ., 113 : 
237 (1 977); Sambrook et al., supra ]. The optimal size of the probe fragments after nick translation and before denaturing 
was 400-800 bps. As indicated above ■ chromosome-specific probes were used in dual-color hybridizations to verify 
the identification of chromosomes of interest in the metaphase spreads. Representative examples of such chromo- 
some-specific reference probes labeled with biotin-14-dATP include the following: 
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1) a chromosome-specific painting probe for chromosome 20 prepared by the PCR adapter-linker method as de- 
scribed in PCY/US90/00434 published August 9, 1990; 

2) a chromosome 17 peri-centromeric repeat probe (cosmid CK17.10) isolated by Anne Kallioniemi from a chro- 
mosome 17 cosmid library from Los Alamos National Laboratory [Albuquerque, New Mexico (USA)]; an equivalent 

5 chromosome-specific repeat sequence probe for chromosome 1 7 is commercially available from Oncor [Gaithers- 

burg, MD(USA)]; and 

3) an alpha satellite repeat probe specific for the centromeric region of chromosome 8 [kindly provided by Dr. 
Heinz-Ulrich G. Weier; University of California Medical Center, Lab for Cell Analysis, San Francisco, CA (USA)]; 
that probe was generated by Dr. Weier using PCR with primers WA1 and WA2 as described in Weier et al., Hum. 

10 Genet., 87 : 489-494 (1991). 

[01 50] Ones skilled in the art recognize that there are many other equivalent probes available that could be used for 
the confirmation purposes described. For example, whole chromosome painting probes are now available for all the 
human chromosomes [Collins et al., Genomics, 11: 997-1006 (1 991)]. Also available are repeat sequence probes that 
*5 hybridize intensely and specifically to selected chromosomes [Trask et al. ; Hum. Genet, 78 : 251 (1 988) and references 
cited therein]. 

Pretreatment and Prehybridization of Slides : 

20 [0151] Lymphocyte metaphase preparations were first denatured in 70% formamide/2XSSC (1XSSC is 0.15 M NaCI, 
0.015 M NaCitrate), pH 7, at 70°C for 2 minutes and dehydrated in a sequence of 70%, 85% and 100% ethanol. The 
slides were then air dried ancUreated with 10 ug/50 ml Proteinase K [Boehringer Mannheim GmbH, Indianapolis IN 
(USA)] for 7.5 minutes at 37°C in a buffer containing 20 mM Tris and 2 mM CaCI 2 (pH 7.5). Ethanol dehydration was 
then done as described above, and the slides were prehybridized with ten ul of a hybridization mixture, consisting of 

25 20 ug unlabeled human placental DNA [obtained from Sigma, St. Louis, MO (USA); size of the fragments is 200-700 
bps] in 50% formamide, 10% dextran sulphate and 2XSSC (pH 7) for 60 minutes at 37°C. Before the prehybridization 
mixture was applied to the slides, it was denatured in a 70°C water bath for 5 minutes. After prehybridization , the slides 
were washed once in 2XSSC and dehydrated with ethanol as described above. 

30 Hybridization : 

[0152] Five ug of unlabeled, repeat-enriched Cot-1 blocking DNA [BRL, Gaithersburg : MD (USA)] and 60 ng of dig- 
oxigenin labeled cell line DNA and 20-60 ng of biotin-labeled reference probes (for verification of chromosome identi- 
fication) were mixed together and 1/10 vol of 3M Na-acetate was added. DNA was precipitated by adding 2 volumes 

35 of 100% ethanol followed by centrifugation in a microcentrifuge for 30 minutes at 15,000 RPM. Ethanol was removed 
and the tubes were allowed to dry until all visible ethanol had evaporated. Ten ul of hybridization buffer consisting of 
50% formamide, 10% dextran sulphate and 2XSSC (pH 7) was then added, followed by careful mixing. DNAs in the 
hybridization buffer were then denatured for 5 minutes at 70°C followed by a 60 minute renaturation at 37°C. The 
hybridization mixture was then added to the prehybridized lymphocyte metaphase slides. Hybridization was carried 

40 out under a coverslip in a moist chamber for 3-4 days at 37°C. 

Immunofluorescent Probe Detection : 

[0153] The slides were washed three times in 50% formamide/ 2XSSC, pH 7, twice in 2XSSC and once in 0.1XSSC 
^5 for 10 minutes each at 45°C. After washing, the slides were immunocytochemically stained at room temperature in 
three steps (30-45 minutes each). Before the first immunocytochemical staining, the slides were preblocked in 1% 
BSA/4XSSC for 5 minutes. The first staining step consisted of 2 ug/ml Texas Red-Avidin [Vector Laboratories, Inc., 
Burfingame, CA (USA)] in 1% BSA/4XSSC. The slides were then washed in 4XSSC, 4XSSC/0.1% Triton X-100, 
4XSSC ; and PN (a mixture of 0.1 M NaH 2 P0 4 and 0.1 M Na 2 HP0 4: pH 8, and 0.1% Nonidet P-40) for 10 minutes each 
50 and preblocked with PNM (5% Carnation dry milk, 0.02% Na-azide in PN buffer) for 5 minutes. The second antibody 
incubation consisted of 2 ug/ml FITC-conjugated sheep anti-digoxigenin [Boehringer Mannheim GmBH, Indianapolis, 
IN (USA)] and 5 ug/ml anti-avidin [Vector Laboratories, Buriingame, CA (USA)] in PNM followed by three PN washes, 
10 minutes each. After the PNM block, the third immunochemical staining was done using rabbit anti-sheep FITC 
antibody (1:50 dilution) (Vector Laboratories) and 2 ug/ml Texas Red-Avidin in PNM. After three PN washes, nuclei 
55 were counterstained with 0.8 uM 4,5-diamidino-2-phenylindole (DAPI) in an antifade solution. 
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Fluorescence Microscopy and Interpretation of Results : 

[0154] A Nikon fluorescence microscope [Nikon Inc., Garden City, NY (USA)] equipped with a double band pass 
filter [Chroma Technology, Brattleboro, VT (USA)] and a 1 00X objective was used for simultaneous visualization of the 

5 FITC and Texas Red signals. Hybridization of the breast cancer cell line DNAs was seen as a more or less uniform 
faint green background staining of all metaphase chromosomes with the exception of the Y-chromosome. As the breast 
cancer cell lines are of course of female origin, they did not contain Y chromosomal DNA. The absence of said green 
staining of the Y chromosome of the metaphase spread is exemplary of the manner in which a cytogenetically significant 
deletion would be visualized. The absence of the Y chromosome in the breast cancer cell tine DNA was detected, as 

10 would a cytogenetically significant deletion, by the hybridization wherein the Y chromosome of the reference spread 
was only stained by the DAP I counterstain. Using a fluorescence microscope, amplified sequences can be seen as 
bright green dots or bands along the chromosome arms. 

[0155] To facilitate the display of the results and to improve the sensitivity of detecting small differences in fluores- 
cence intensity, a digital image analysis system (QUIPS) was used. QUIPS (an acronym for quantitative image process- 
's ing system) is an automated image analysis system based on a standard Nikon Microphot SA [Nikon Inc., Garden City, 
NY (USA)] fluorescence microscope equipped with an automated stage, focus control and filterwheel [Ludl Electronic 
Products Ltd., Hawthorne, NY (USA)]. The filterwheel is mounted in the fluorescence excitation path of the microscope 
for selection of the excitation wavelength. Special fitters [Chroma Technology, Brattleboro, VT (USA)] in the dichroic 
block allow excitation of multiple dyes without image registration shift. The microscope has two camera ports, one of 
20 which has an intensified CCD camera [Quantex Corp., Sunnyvale, CA (USA)] for sensitive high-speed video image 
display which is used for finding interesting areas on a slide as well as for focusing. The other camera port has a cooled 
CCD camera [model 200 by Rhotometrics Ltd. : Tucson, AZ (USA)] which is used for the actual image acquisition at 
high resolution and sensitivity. 

[0156] The cooled CCD camera is interfaced to a SUN 4/330 workstation [SUN Microsystems Inc., Mountain View, 
25 CA (USA)] through a VME bus. The entire acquisition of multicolor images is controlled using an image processing 
software package SCIL-Image [Delft Centre for Image Processing, Delft, Netherlands]. Other options for controlling 
the cameras., stage, focus and filterwheel as well as special programs for the acquisition and display of multicolor 
images were developed at the Division of Molecular Cytometry [University of California, Medical Center; San Francisco, 
CA (USA)] based on the SCIL-Image package. 
30 [01 57] To display the results of the comparative hybridization, two or three consecutive images were acquired (DAPI, 
FITC and Texas Red) and superimposed. The FITC image was displayed after using the thresholding and contrast 
enhancement options of the SCIL-Image software. Exercising such options reduces the overall chromosomal fluores- 
cence to make amplified sequences more readily visible. For example, using thresholding and contrast stretching, it 
was possible to enhance the contrast and quantification between the faint green background staining and staining 
35 originating from the amplified sequences in the cell lines. Alternatively, to facilitate the detection of deletions, it is 
possible to increase the overall chromosomal fluorescence and make areas of reduced fluorescence appear darker. 
The red color was used for reference probes to help in the identification of chromosomes. 

[0158] After identification of the chromosomes based on the use of reference probes in a dual-color hybridization, a 
site of amplification was localized by fractional length measurements along the chromosome arm (fractional length ~ 
^0 distance of the hybridization signal from the p-telomere divided by the total length of the chromosome). The band 
location of the signal was then approximated from the fractional length estimate based on the ISCN 1985 idiograms 
[Harnden and Klinger, An International System for Cytogenetic Nomenclature , Karger Ag, Basel, Switzerland (1985)]. 

Results : 

45 

[01 59] The results from the hybridizations are compiled in Table 2 along with other information known about the cell 
lines. Amplification at 17q12 (erbB-2 locus) and approximately 8q24 (MYC tocus) was seen in lines showing amplifi- 
cation of erbB-2 and MYC whenever the level of amplification was greater than about five- to ten-fold using this CGH 
method. In addition, amplification of several megabase wide regions was seen in three cell lines at 17q22-23 and in 
50 three lines at 20qter; those amplifications were previously unknown sites of amplification and were not expected from 
other studies. For example, as indicated in Table 2, the BT-474 cell line is known to have a 1 3-fold c-erbB-2 amplification; 
CGH revealed amplified sequences at the following loci: 17q12 (the erbB-2 locus); 17q22-q23 and 20q13-ter. The latter 
two sites were previously unrecognized sites of amplification in that cell line. 

[0160] All lines showing amplification showed amplification at more than one site. Evidence for co-amplification may 
55 be clinically important since co-amplification has been observed previously [van de Vijver et a!., Mol. Celt Biol. 7 : 
2019-2023 (1987): Saint-Ruf et aL Oncogene, 6 : 403-406 (1991)], and is sometimes associated with poor prognosis 
[Borg et at., Br. J. Cancer, 63 : 136-142 (1991)]. Amplification at 17q22-23 has also been seen using probe DNA from 
primary tumors. 
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TABLE 2 



Results of Testing Breast Cancer Cell Lines for Amplification 



Cell Line 


Origin 


Growth rate; -Td 


Hormone 


Known 
drnpiiHCaiion 
(level) 


Amplification 
ueiecieo Dy obn 


BT-474 


Primary Cancer 


48-96 hr 


+/- 


erbB-2 (13X) 


17q1 2 (erbB-2), 
t /q^-^o, ^uqter 


SK-BR-3 


Pi. Effusion 


? 




erbB-2 (9X) MYC 
(10X) 


17q 12 (erbB-2), 
8q21, 8q23-24.1 
(MYC),20qter 


MDA-MB-361 


Brain met. 


<96hr 


-/+ 


erbB-2 (4X) 


17q22-23 


MCF-7 


PI. Effusion 


<48hr 


+/+ 


erbB-2 (none) 


17q22-23,20qter 


T-47D 


PI. Effusion 


? 


+/+ 


erbB-2 (none) 


None 


600MPE 


PI. Effusion 


? 


? 


erbB-2 (none) 


None 


MDA-MB-468 


PI. Effusion 


? 


? 


erbB-2 (none) 


None 



20 Example 2 

[0161] Hybridizations with two different labeled subject DNAs as schematically outlined in Figures 5 and 6 were 
performed. One of the labeled subject DNAs hybridized was a cell line DNA as described in Example 1 and similarly 
labeled. The other labeled subject DNA was human genomic DNA labeled with biotin-14-dATP. 
25 [0162] The protocols were essentially the same as in Example 1 except that no chromosome-specific reference 
probes were used, and the same amount of the labeled human DNA as the labeled cell line DNA, that is, 60 ng, was 
hybridized. Of course, reference probes could be added to the hybridization mixture, but they need to be differently 
labeled to be distinguishable. 

[0163] The results showed the normal DNA with a red signal and the cell line DNA with a green signal. The green 

30 to red ratios were determined along each chromosome. Amplification was indicated by an area where the signal was 
predominantly green whereas deletions were indicated by more red signals than in other areas of the chromosomes. 
[0164] Exemplary, CGH results using breast cancer cell line 600MPE DNA and normal hirman DNA were as follows. 
As indicated above, the hybridization was performed using 5 ug Cot-1 DNA, 60 ng of digbxigenin labeled 600MPE cell 
line DNA, and 60 ng of btotinylated norma! human genomic DNA. The 600MPE DNA was detected with FITC (green) 

35 and the genomic DNA with Texas Red-Avidin (red). 

[0165] The 600MPE breast cancer cell line, the karyotype for which was published by Smith et al., JNCI, 78: 611-615 
(1987), contains one normal chromosome 1 and three marker chromosomes with chromosome 1 material in them: t 
(1 q: 1 3q), 1 p(p22) and inv(1 )(p36q21 ). Thus, the cell line is disomic for the p-telomere-p22, trisomic for p22-centromere 
and tetrasomic for the q-arm of chromosome 1. 

40 [01 66] The comparative genomic hybridizations of this example apparently identified three different regions on chro- 
mosome 1 that could be separated according to the intensities of green and red colors. The q-arm of chromosome 1 
had the highest intensity of green color (tumor DNA). The region from band p22 to the centromere was the second 
brightest in green, and the area from the p-telomere to band p22 had the highest intensity of red color (normal DNA). 
Those hybridization results were consistent with the traditional cytogenetic analyses of that cell line stated immediately 

45 above. 

[0167] However, further studies with CGH, as presented in Example 3, indicated that the CGH analysis of this ex- 
ample, as well as the published karyotype, were partially in error. The CGH analysis of Example 3 motivated additional 
confirmatory experiments : as described therein, leading to correction of . the original CGH results and the published 
karyotype. 

50 

Example 3 

Copy Number Karyotypes of Tumor DNA 

55 [01 68] In the representative experiments of CGH in this example, biotinylated total tumor DNA (cell line and primary 
tumor DNA) and digoxigenin-labeled normal human genomic DNA are simultaneously hybridized to normal human 
metaphase spreads in the presence of unlabeled blocking DNA containing high-copy repetitive sequences, specifically 
unlabeled Cot-1 blocking DNA [BRL, Gaithersburg, MD (USA)). The following paragraphs detail the procedures used 
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for the representative CGH experiments of this example. 
DNA Labeling : 

[0169] DNAs used in this example were labeled essentially as shown above in Example 1 . DNAs were labeled with 
biotin-14-dATP or digoxigenin-11-dUTP by nick translation [Rigby et aL, supra ; Sambrook et al., supra ]. The optimal 
size for double stranded probe fragments after labeling was 600-1000 bp. 

Pretreatment of Metaphase Spreads : 

[0170] Lymphocyte metaphase preparations were denatured, dehydrated and air dried, treated with Proteinase K 
and dehydrated again as described in Example 1 . 

Comparative Genomic Hybridization : 

[01 71] Sixty ng of biotinylated test DNA, 60 ng of digoxigenin-labeled normal DNA and 5 ng of unlabeled CoM DNA 
(BRL) were ethanol precipitated and dissolved in 10 p.1 of 50% formamide, 10% dextran sulfate, 2xSSC, pH 7. The 
probe mixture was denatured at 70°C for 5 minutes, allowed to reanneal at 37°C for 60 minutes and hybridized to 
normal male metaphase chromosomes for 3-4 days at 37°C. 

Immunofluorescent Probe Detection: 



[0172] The slides were washed as described above in Example 1 , and immunocytochemically stained at room tem- 
perature in three thirty-minute steps: (I) 5 (ig/ml FITC-Avidin [Vector Laboratories, Inc., Burlingame : CA (USA)) and 2 
25 pg/ml anti-digoxigenin-Rhodamine (Boehringer Mannheim GMbH); (II) 5 p.g/ml anti-avidin (Vector Laboratories); and 
(III) 5 u.g/m) FITC-avidin. Nuclei were counterstained with 0.8 u.M 4,5-diamino-2-phenylindole (DAPI) in antifade solu- 
tion. A Zeiss fluorescence microscope equipped with a double band pass filter [Chroma Technology, Brattleboro, VT 
(USA)] was used for simultaneous visualization of FITC and rhodamine signals. 

30 Digital Image Analysis System and Fluorescence Ratio Profiles 

[0173] The QUIPS system essentially a described above in Example 1 was used to analyse quantitatively the fluo- 
rescence signals. Fluorescence ratio profiles along the chromosomes were extracted using WOOL2 software package 
[developed at MRC, Edinburgh, Scotland] as follows: the DAPI image is used to set the morphological boundary of 

35 each chromosome by thresholding. The chromosome outline is smoothed by a n number of opening and closing op- 
erations, a modified Hilditch skeleton is calculated and taken to represent the medial axis of the chromosome. The 
DAPI image is expanded outwards in all directions until the intensity field levels off (when background is reached) or 
begins to rise (due to an adjacent chromosome). The intensity profile of each image along the medial axis and within 
the expanded DAPI image is then calculated by summing the green and red fluorescence pixel values along the se- 

4£> quence of lines perpendicular to and spaced at unit distance along the medial axis. Modal green and red intensity 
values corresponding to the expanded DAPI image are taken to represent the background fluorescence and used as 
the intensity origin. 



Cell Lines : 
[0174] 

5637 - Originated from a human primary bladder carcinoma; obtained from ATCC, catalog U HTB 9 

50 SK-BR-3 - Originated from a human metastatic breast adenocarcinoma, derived from a pleural effusion; ob- 
tained from the ATCC, catalog # HTB 30 

Colo 205 - Originated from a human colon adenocarcinoma; obtained from the ATCC : catalog it CCL 222 

55 NCI-H508 - Originated from a human cecum adenocarcinoma; obtained from the ATCC, catalog # CCL 253 

SW480 - Originated from a human colon adenocarcinoma; obtained from the ATCC : catalog U CCL 228 



9K 



EP 0 631 635 B1 



SW620 - Originated from a human lymph node metatasis of a colon adenocarcinoma; obtained from the 

ATCC, catalog # CCL 227 

WiDr - Originated from a human colon adenocarcinoma; obtained from the ATCC, catalog # CCL 218 

SK-N-MC - Originated from a human neuroblastoma (metastasis to supra-orbital area); obtained from the ATCC, 

catalog # HTB 10 

CaLu3 - Originated from a human lung adenocarcinoma, derived from a pleural effusion; obtained from the 

ATCC, catalog # HTB 55 

CaLu6 - Originated from a human anaplastic carcinoma, probably lung; obtained from the ATCC, catalog # 

HTB 56 

NCI-H69 - Originated from a human small cell lung carcinoma; obtained from the ATCC, catalog # HTB 119 

COLO 320HSR - Originated from a human colon adenocarcinoma; obtained from the ATCC, catalog # 220.1 

600 PE - Originated from a human breast carcinoma; obtained from Dr. Helene Smith and Dr. Ling Chen 

[Geraldine Brush Cancer Research Center, San Francisco, CA (USA)]. This is the same as the 600 
MPE cell line described in Examples 1 and 2. 

BT-20 - Originated from a human breast carcinoma; obtained from ATCC, catalog # HTB 1 9 

[0175] The following are five fibroblast cell lines with total chromosomal number and X chromosomal number in 
parentheses, which were obtained from the NIGMS repository [Camden, NJ (USA)]: 

GM01723(45,XO) 
GM08399 (46,XX) 
GM04626 (47.XXX) 
GM01415E(48,XXXX) 
GMO5009B (49.XXXXX). 

* Results and Discussion : 

[0176] Demonstrated herein is CGH's capability of detecting and mapping relative DNA sequence copy number 
between genomes. A comparison of DNAs from malignant and normal cells permits the generation of a "copy number 
karyotype" for a tumor, thereby identifying regions of gain or loss of DNA. 

[0177] Demonstrated is the use of dual color fluorescence in situ hybridization of differently labeled DNAs from a 
subject tumor genome and a normal human genome to a normal human metaphase spread to map DNA sequence 
copy number throughout the tumor genome being tested. Regions of gain or loss of DNA sequences, such as deletions, 
duplications or amplifications, are seen as changes in the ratio of the intensities of the two fluorochromes (used in this 
representative example) along the target chromosomes. Analysis of tumor cell lines and primary bladder tumors iden- 
tified 16 different regions of amplification, many in loci not previously known to be amplified. Those results are shown 
in Table 3 below. 

[0178] The tumor DNA is detected with the green fluorescing FlTC-avidin : and the norma! DNA with the red fluo- 
rescing rhodamine anti-digoxigenin. The relative amounts of tumor and normal DNA bound at a given chromosomal 
locus are dependent on the relative abundance of those sequences in the two DNA samples, and can be quantitated 
by measurement of the ratio of green to red fluorescence. The normal DNA in this example serves as a control for 
local variations in the ability to hybridize to target chromosomes. Thus, gene amplification or chromosomal duplication 
in the tumor DNA produces an elevated green-to-red ratio, and deletions or chromosomal loss cause a reduced ratio. 
The Cot-1 DNA included in the hybridization inhibits binding of the labeled DNAs to thecentromeric and heterochromatic 
regions so those regions are excluded from the analysis. 

[0179] The fluorescence signals were quantitatively analyzed by means of a digital image analysis system as de- 
scribed above. A software program integrated the green and red fluorescence intensities in strips orthogonal to the 
chromosomal axis, subtracted local background, and calculated intensity profiles for both colors and the green-to-red 
ratio along the chromosomes. 

[0180] The ability of CGH to quantitate changes in sequence copy number that affect an entire chromosome was 
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tested with the above-listed five fibroblast cell lines having 1 to 5 copies of the X chromosome and two copies of each 
autosome. Hybridization of DNA from the 45.XO cell line (in green) together with normal female DNA (in red) resulted 
in a uniform green-red staining of the autosomes whereas the X chromosome appeared more red. (The reference 
spread as indicated above was of normal male chromosomes. Some faint staining of a small part of the Y chromosome 

5 was the result of the binding of homologous sequences in the pseudo-autosomal region.) 

[0181] Hybridizations with DNA from cell lines carrying 2. 3, 4 or 5 copies of the X chromosome resulted in an in- 
creasingly strong green fluorescence from the X chromosome in relation to the autosomes. The average gree^to-red 
fluorescence ratio of the X chromosome (Figure 7), when normalized to the average ratio for the autosomes within the 
same metaphase spread, increased linearly with the increasing number of X chromosomes [correlation coefficient (r) 

10 = 0.978]. Thus, CGH can quantitatively distinguish a change of plus or minus one copy of a chromosome at least up 
to 4 copies. 

[0182J Experiments showed that CGH could generate a complete copy number karyotype for a near-diploid breast 
cancer cell line, 600PE. According to the published karyotype for 600PE [Smith et al., JNCI, 78: 611 (1987)], 600PE 
is near-diploid with five marker chromosomes having four copies of the q-arm of chromosome 1 , monosomy 16, and 

* 5 deletions of 9p, 11 q and 17p. CGH using biotinylated 600PE DNA (in green) and normal digoxigenin-labeled DNA (in 
red) revealed the following relative copy number changes: gain of 1q and loss of 9p, 16q, 17p and distal 11q. The 
green-to-red ratio profiles for those aberrant chromosomes are shown in Figure 8. Only the q-arm of chromosome 16 
showed decreased relative copy number suggesting that 16p was not deleted. That observation was subsequently 
confirmed by fluorescence in situ hybridization (FISH) to 600PE interphase ceils using cosmid probes for the p- and 

20 q-arms of chromosome 16 [1 6p and 1 6q cosmid probes provided by Los Alamos National Laboratory, Los Alamos, NM 
(USA)]; two signals per nucleus for the 16p cosmid probe and one for the 16q cosmid probe permitted calibration of a 
green-to-red ratio of 1.0 as inchoating two copies of a sequence. 

[0183] Thus, if the absolute copy number of any point in the tumor genome is known, relative copy numbers can be 
converted to actual copy numbers at al! loci. The CGH results differed from the originally published karyotype in the 
25 region of 1 6p and proximal 1 p. That discrepancy was resolved by locus-specific chromosome-specific painting (FISH) 
that indicated that the components of one of the marker. chromosomes had been misinterpreted by conventional cy- 
togenetic analysis. 

[01 84] CGH with DNAs from two fibroblast cell lines [GM05877 and GM01 1 42A from the NIGMS repository] detected 
small interstitial deletions around the RB-1 locus in 13q- del(13) (pter > q14.1 ::q21 .2 > qter) and de!(13) (pter > q14.t :: 
30 q22.1 > qter). On the basis of the CGH analysis and measurement of the deletion size as a fraction of the length of 
chromosome 13 [total length 111 megabases (Mb)], those deletions were estimated to span about 10 and 20 Mb, 
respectively. Thus, it is possible that CGH can be used to screen DNA samples from solid tumors in order to identify 
large physical deletions that may uncover recessive mutant tumor suppressor genes. 

[01 85] CGH was evaluated for its ability to detect increased gene copy number with cell lines that contained previously 
35 reported amplification of oncogenes. Figure 9A shows CGH with DNA from a colon cancer cell line COLO 320HSR, 
known to contain more than a 50-fold amplification of a 300 kb region around the myc oncogene [Kinzku et al., PNAS 
(USA), 83: 1031 (1986)]. The expected high green-to-red ratio at 8q24 corresponding to the location of myc is clear. 
The height of the peak does not quantitatively reflect the level of amplification because the fluorescent signal spread 
over a region of the chromosome that is larger than the length of the amplicon. That is apparently a result of the complex 
to organization of the target DNA in the denatured chromosomes. 

[01 86] The eight-fold amplification of the erbB2 oncogene in the SK-BR-3 breast cancer cell line also was detectable 
with CGH as a hybridization signal at 17q12 (Table 3). High level amplifications such as those also could be detected 
in single color-hybridizations with the use of only labeled tumor DNA. 

[0187] Cytogenetic and molecular studies of primary tumors and cell lines often reveal homogeneously staining 
45 regions and double minute chromosomes that do not involve known oncogenes [Saint-Ruf et al., Genes Chrom. Can- 
cer., 2: 18 (1990); Bruderlein et al., Genes Chrom. Cancer, 2: 63 (1990)]. CGH allows straightforward detection and 
mapping of such sequences. Table 3 contains a summary of the analysis with CGH of 11 cancer cell lines. Data in 
Table 3 is based on the visual inspection of a large number of metaphase spreads and on detailed digital image analysis 
of four to six metaphases for each sample. 

50 
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TABLE 3 



Mapping of amplified sequences in established cancer cell lines and primary tumors by CGH 


Specimen 


Origin 


Amplif. by CGH* 


Cytogenetic evidence of gene amplif. + 


Cell lines: 


5637 


Bladder 


3p25, 6p22 


DM 


SK-BR-3 


Breast 


8q24 (myc), 8q21, 17q12 (erbB2), 
20q13 




Colo 205 


Colorectal 


6p21 , 6q24 




NCI-H508 


Colorectal 


14q12-13 


DM 


SW480 


Colorectal 


8q24.(myc) 


DM 


SW620 


Colorectal 


16q21-23 


HSR 


WiDr 


Colorectal 


8q23-24 (myc) 




SK-N-MC 


Neuroblastoma 


8q24 (myc) 


DM 


Cal_u3 


Small cell lung 


8p12-21, 8qtet, 17q12 (erbB2) 


HSR 


CaLu6 


Small cell lung 


. 13q32-34 




NCI-H69 


Small cell lung 


2p24 (N-myc), 2p21,2q21 




Primary tumors: 


UR140 


Bladder carcinoma 


16q21-22 




UR145 


Bladder carcinoma 


6p22 





* The oncogene most likely involved in Ihis amplification is shown in parentheses. 

4 Cytogenetic information based on the ATCC Catalogue of Cell Lines & Hybridomas (1992). 

DM = double minute chromosomes, HSR - homogeneously staining regions. 



[0188] Sixteen amplified loci were mapped, many at regions of the genome where amplification had not previously 
been suspected. Thus, a large variety of genes may be amplified during cancer initiation and progression. In five of 
the 11 cell lines, more than one locus was amplified. Two or three separate loci on the same chromosome were amplified 
in four cell lines, which suggests a spatial clustering of chromosomal locations that undergo DNA amplification (Table 
3 and Figure 9B). 

[0189] CGH was also applied to identify and map amplified DNA sequences in uncultured primary bladder tumors. 
Of the seven tumors tested, two showed evidence of DNA amplification but the loci were not the same (Table 3). Thus, 
a number of previously unsuspected genomic regions that might contain genes important for cancer progression have 
been identified by CGH. Further studies will elucidate which of those loci contain novel oncogenes and which represent 
coincidental, random DNA amplification characteristic of genomic instability. 

[0190] The detection and mapping of unknown amplified sequences that typically span several hundred kilobases 
(kb) to a few Mb demonstrated the usefulness of CGH for rapid identification of regions of the genome that may contain 
oncogenes. Analogously, detection of deletions may facilitate identification of regions that contain tumor suppressor 
genes. 

[0191] Further studies are necessary to establish to what extent allelic tosses in tumors are caused by physical 
deletions. In clinical specimens, the detection of small copy number differences is more difficult than with cell lines 
because of the admixture of DNA from contaminating normal cells and because of intratumor heterogeneity. As indi- 
cated above, using PCR to prepare tumor DNA from a small number of tumor cells (as a tumor clonal sub-population) 
may assist in resolving that problem. Like RFLP, CGH emphasizes the detection of aberrations that are homogeneous 
in a cell population and averages those that are heterogeneous. 

[01 92] At the current stage of development of CGH, sensitivity is primarily limited by the granularity of the hybridization 
signals in the metaphase chromosomes. Further improvements in sensitivity will be achieved by optimization of the 
probe concentration and labeling, and by the averaging of the green-to-red fluorescence ratios from several metaphase 
spreads. 

[0193] The descriptions of the foregoing embodiments of the invention have been presented for purposes of illus- 
tration and description. They are not intended to be exhaustive or to limit the invention to the precise form disclosed, 
and obviously many modifications and variations are possible in light of the above teachings. The embodiments were 
chosen and described in order to best explain the principles of the invention and its practical application to enable 
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thereby others skilled in the art to best utilize the invention in various embodiments and with various modifications as 
are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the claims 
appended hereto. All references cited herein are incorporated by reference. 



Claims 

1 . A method of comparing copy numbers of unique DNA sequences in a first cell or cell population relative to copy 
numbers of substantially identical sequences in a second cell or cell population, said method comprising the steps 
of: (a) labelling DNA sequences from each cell or cell population with a different label; (b) hybridizing said labeled 
DNA sequences from each cell or cell population to a reference genome under the following conditions: (i) the 
labeled DNA sequences, and/or the reference genome have their repetitive sequences blocked and/or removed, 
if necessary; and <ii) unique DNA sequences in the labeled DNA sequences and unique DNA sequences in the 
reference genome are retained; (c) comparing the intensities of the signals from those labeled DNA sequences, 
if any, which are hybridized to the reference genome, to determine relative copy number of differently labeled DNA 
sequences hybridized to the same position in the reference genome. 

2. The method of claim 1 , wherein the reference genome comprises at least one metaphase chromosome. 

3. The method of claim 1 , wherein the DNA sequences from at least one of said first cell or cell population and said 
second cell or cell population are isolated from tumor cells. 

4. The method of claim 1 , wherein the DNA sequences from at least one of said first cell or cell population and said 
second cell or cell population are isolated from fetal cells. 

5. The method of claim 1 , wherein the DNA is chromosomal DNA or cDNA. 

6. The method of claim 1 , wherein the reference genome is an antenna cell line. 

7. The method of claim 1 : further comprising the step of amplifying the DNA sequences from at least one of said first 
cell or cell population and said second cell or cell population prior to said hybridizing step. 

8. The method of claim 1 , wherein the labeled DNA is directly visualizable. 

9. The method of claim 1 , further comprising rendering the bound, labeled DNA visualizable after said hybridization 
step. 

10. The method of claim 1 , wherein blocking of the repetitive sequences is carried out by including unlabeled copies 
of the repetitive sequences with said labeled DNA sequences. 

11. The method according to claim 1 , wherein at least one of said first cell or cell population and said second cell or 
cell population is derived from a clinical specimen. 

12. The method according to claim 1 1 wherein the hybridization is done in situ s the reference genome comprises ref- 
erence metaphase chromosomes, and the intensities of signals from labeled nucleic acid sequences are compared 
as a function of position in the reference genome. 

13. The method according to claim 1 , wherein the labeled DNA sequences are hybridized to a portion of the reference 
genome. 

14. The method according to claim 1 , wherein said first ceil or cell population and said second cell or cell population 
and said reference genome are from the same species. 

15. The method according to claim 1 , wherein the reference genome is human. 

1 6. The method according to claim 1 , wherein said labeled nucleic acid sequences are labeled with fluoresce^ ligands, 
chemiluminescers, enzyme substrates, enzyme cofactors, particles or dyes. 
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17. The method according to claim 1 , which method further comprises extracting the DNA from said first cell or cell 
population and/or said second cell or cell population from formalin-fixed and/or paraffin-embedded archived tissue 
specimens prior to labelling. 

5 18. The method according to claim 1, wherein the step of comparing the intensities of the signals from the labeled 
DNA sequences comprises determining the ratio of the intensities of the signals as a function of position in the 
reference genome, said method further comprising quantitatively comparing the intensity ratio among different 
locations along the reference genome, said ratio at each location being proportional to the ratio of the copy number 
of the nucleic acid sequence that binds to that location in the first cell or cell population to the copy number of a 

10 substantially identical sequence in the second celt or cell population. 

19. The method according to claim 1, wherein one of the celts or cell populations is a test cell or cell population and 
the other is a normal cell or cell population. 

15 20. The method according to claim 1 , further comprising employing in situ hybridization or Southern analysis to deter- 
mine absolute copy number of unique DNA sequences throughout the genome. 

21. The method according to claim 1, wherein the copy number of unique DNA sequences in more than two celts or 
cell populations is compared. 

20 

22. The method according to claim 1 , wherein the DNA sequences from each cell or cell population are labeled with 
a different fluorochrome label, and the fluorochrome labeled DNA sequences from each cell or cell population are 
hybridized to the reference genome in situ. 

25 23. The method of claim 22, wherein the reference genome comprises metaphase chromosomes. 

24. The method according to claim 22, wherein the reference genome comprises reference metaphase chromosomes, 
and the intensities of signals from labeled DNA sequences are compared as a function of position in the reference 
genome. 

30 

25. The method according to claim 22, wherein the step of comparing the intensities of the signals from the labeled 
DNA sequences comprises determining the ratio of the intensities of the signals as a function of position in the 
reference genome. 

35 26. The method according to claim 25, further comprising quantitatively comparing the intensity ratio among different 
locations along the reference genome, said ratio at each location being proportional to the ratio of the copy number 
of the DNA sequence that binds to that location in the first cell or cell population to the copy number of a substantially 
identical DNA sequence in the second cell or cell population. 

40 27. The method according to claim 22, wherein the copy number of unique DNA sequences in more than two cells or 
cell populations is compared. 

28. A method of comparing copy numbers of unique RNA sequences in a first cell or cell population relative to copy 
numbers of substantially identical sequences in a second cell or cell population said method comprising the steps 

^5 of: (a) labelling RNA sequences from each cell or cell population with a different label; (b) hybridizing said labeled 

RNA sequences from each cell or cell population to a reference genome under the following conditions: (i) the 
labeled RNA sequences, and/or the reference genome have their repetitive sequences blocked and/or removed, 
if necessary; and (ii) unique RNA sequences in the labeled RNA sequences and unique RNA sequences in the 
reference genome are retained; (c) comparing the intensities of the signals from those labeled RNA sequences, 

so if any, which are hybridized to the reference genome, to determine relative copy number of differently labeled RNA 

sequences hybridized to the same position in the reference genome. 

29. The method of claim 28, wherein the reference genome comprises at least one metaphase chromosome. 

55 30. The method of claim 28, wherein the RNA sequences from at least one of said first cell or cell population and said 
second cell or cell population are isolated from tumor cells. 

31 . The method of claim 28, wherein the RNA sequences from at least one of said first cell or cell population and said 
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second eel! or celt population are isolated from fetal cells. 

32. The method of claim 28, wherein the RNA is messenger RNA. 

5 33. The method of claim 28, wherein the reference genome is an antenna cell line. 

34. The method of claim 28, further comprising the step of amplifying the RNA sequences from at least one of said 
first cell or cell population and said second cell or cell population prior to said hybridizing step. 

10 35. The method of claim 28, wherein the labeled RNA is directly visualizable. 

36. The method of claim 28, further comprising rendering the bound, labeled RNA visualizable after said hybridization 
step. 

15 37. The method of claim 28, wherein blocking of the repetitive sequences is carried out by including unlabeled copies 
of the repetitive sequences with said labeled RNA sequences. 

38. The method according to claim 28, wherein at least one of said first cell or cell population and said second cell or 
cell population is derived from a clinical specimen. 

20 

39. The method according to claim 28, wherein the hybridization is done in situ, the reference genome comprises 
reference metaphase chrcjxiosomes, and the intensities of signals from labeled nucleic acid sequences are com- 
pared as a function of position in the reference genome. 

25 40. The method according to claim 28, wherein the labeled RNA sequences are hybridized to a portion of the reference 
genome. 

41 . The method according to claim 28, wherein said first cell or cell population and said second cell or cell population 
and said reference genome are from the same species. 

30 

42. The method according to claim 28, wherein the reference genome is human. 

43. The method according to claim 28, wherein said labeled RNA sequences are labeled with fluoresces, ligands, 
chemiluminescers, enzyme substrates, enzyme cofactors, particles or dyes. 

35 

44. The method according to claim 28, which method further comprises extracting the RNA from said first cell or cell 
population and/or said second cell or cell population from formalin-fixed and/or paraffin-embedded archived tissue 
specimens prior to labelling. 

40 45. The method according to claim 28, wherein the step of comparing the intensities of the signals from the labeled 
RNA sequences comprises determining the ratio of the intensities of the signals as a function of position in the 
reference genome ; said method further comprising quantitatively comparing the intensity ratio among different 
locations along the reference genome, said ratio at each location being proportional to the ratio of the copy number 
of the nucleic acid sequence that binds to that location in the first cell or cell population to the copy number of a 

45 substantially identical sequence in the second cell or cell population, 

46. The method according to claim 28, wherein one of the cells or cell populations is the test cell or cell population 
and the other is a normal cell or cell population. 

so 47. The method according to claim 28, further comprising employing in situ hybridization or Northern analysts to de- 
termine absolute copy number of unique RNA sequences throughout the genome. 

48. The method according to claim 28, wherein the copy number of unique RNA sequences in more than two cells or 
cell populations is compared. 

55 

49. The method according to claim 28, wherein the RNA sequences from each cell or cell population are labeled with 
a different fluorochrome label, and the fluorochrome labeled RNA sequences from each cell or cell population are 
hybridized to the reference genome in situ. 
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50. The method of claim 49, wherein the reference genome comprises metaphase chromosomes. 

51 . The method according to claim 49, wherein the reference genome comprises reference metaphase chromosomes, 
and the intensities of signals from labeled RNA sequences are compared as a function of position in the reference 
genome. 

52. The method according to claim 49, wherein the step of comparing the intensities of the signals from the labeled 
RNA sequences comprises determining the ratio of the intensities of the signals as a function of position in the 
reference genome. 

53. The method according to claim 52, further comprising quantitatively comparing the intensity ratio among different 
locations in the reference genome, said ratio at each location being proportional to the ratio of the copy number 
of the RNA sequence that binds to that location in the first cell or cell population to the copy number of a substantially 
identical RNA sequence in the second cell or cell population. 

54. The method according to claim 49, wherein the copy number of unique RNA sequences in more than two cells or 
cell populations is compared. 

55. The method according to claim 25, additionally comprising: 

(d) determining the copy number of a calibration sequence in said first and second cells or cell populations, 
said calibration sequence being substantially identical to a unique sequence in the reference genome; and 

(e) normalizing the ratios determined in claim 25 so that the ratio at the calibration position in the reference 
genome is equal to the ratio of the copy numbers determined in step (d), the normalized ratio at any other 
location in the reference genome thereby giving the ratio of the copy numbers of the DNA sequences in the 
labeled nucleic acids that hybridize to that location. 

56. The method according to claim 18, additionally comprising: 

(d) determining the copy number of a calibration sequence in said first and second cells or cell populations, 
said calibration sequence being substantially identical to a unique sequence in the reference genome; and 

(e) normalizing the ratios determined in claim 18 so that the ratio at the calibration position in the reference 
genome is equal to the ratio of the copy numbers determined in step (d), the normalized ratio at any other 
location in the reference genome thereby giving the ratio of the copy numbers of the DNA sequences in the 
labeled nucleic acids that hybridize to that location, 

wherein said step of quantitatively comparing the intensity ratio among different positions along the reference 
genome comprises the comparison of normalized ratios, determined in step (e). 

57. The method according to claim 52, additionally comprising: 

(d) determining the copy number of a calibration sequence in said first and second cells or cell populations, 
said calibration sequence being substantially identical to a unique sequence in the reference genome; and 

(e) normalizing the ratios determined in claim 52 so that the ratio at the calibration position in the reference 
genome is equal to the ratio of the copy numbers determined in step (d), the normalized ratio at any other 
location in the reference genome thereby giving the ratio of the copy numbers of the RNA sequences in the 
labeled nucleic acids that hybridize to that location. 

58. The method according to claim 45, additionally comprising: 

(d) determining the copy number of a calibration sequence in said first and second cells or cell populations, 
said calibration sequence being substantially identical to a unique sequence in the reference genome; and 

(e) normalizing the ratios determined in claim 45 so that the ratio at the calibration position in the reference 
genome is equal to the ratio of the copy numbers determined in step (d), the normalized ratio at any other 
location in the reference genome thereby giving the ratio of the copy numbers of the RNA sequences in the 
labeled nucleic acids that hybridize to that location, 

wherein said step of quantitatively comparing the intensity ratio among different positions along the reference 
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Patentanspriiche 

5 

1 . Methode zum Vergleichen von Kopienanzahlen einmaliger DNA-Sequenzen in einer ersten Zelie oder Zellpopu- 
lation im Verhaitnis zu Kopienanzahlen von im Wesentlichen identischen Sequenzen in einer zweiten Zelle oder 
Zetlpopulation, wobei die Methode die nachstehenden Schritte einschlieBt: (a) Markieren von DNA-Sequenzen 
aus jeder Zelle oder Zellpopulation mit einer unterschiedlichen Markierung; (b) Hybridisieren der markierten DNA- 

10 Sequenzen aus jeder Zelle oder Zellpopulation mit einem Bezugsgenom, und zwar unter den folgenden Bedin- 

gungen: (i) die markierten DNA-Sequenzen und/oder das Bezugsgenom weisen ihre Wiederholungssequenzen 
blockiert und/oder entfernt auf, falls erforderiich, und (ii) einmalige DNA-Sequenzen in den markierten DNA-Se- 
quenzen und einmalige DNA-Sequenzen im Bezugsgenom werden zuruckbehalten; (c) Vergleichen der Intensi- 
taten der Signale von jenen markierten DNA-Sequenzen - falls es wefche gibt -, die mit dem Bezugsgenom hybri- 
ds disiert werden, urn die relative Kopienanzahl unterschiedlich markierter, mit der gleichen Position im Bezugsgenom 
hybridisierter DNA-Sequenzen zu bestimmen. 

2. Methode nach Anspruch 1 , wobei das Bezugsgenom mindestens ein Metaphasen-Chromosom enthalt. 

20 3. Methode nach Anspruch 1 , wobei die DNA-Sequenzen aus mindestens einer der ersten Zelle oder Zellpopulation 
und der zweiten Zelte oder Zellpopulation aus Tumorzellen isoliert sind. 

4. Methode nach Anspruch 1 , wobei die DNA-Sequenzen aus mindestens einer der ersten Zelle oder Zellpopulation 
und der zweiten Zelle oder Zellpopulation aus fetalen Zellen isoliert sind. 

25 

5. Methode nach Anspruch 1 , wobei die DNA chromosomale DNA oder cDNA ist. 

6. Methode nach Anspruch 1, wobei das Bezugsgenom eine Antenna-Zelllinie ist. 

30 7. Methode nach Anspruch 1 , ferner einschlieBend den Schritt eines Amplifizierens der DNA-Sequenzen aus min- 
destens einer der ersten Zelle oder Zellpopulation und der zweiten Zelle oder Zellpopulation vor dem Hybridisie- 
rungsschritt. 

8. Methode nach Anspruch 1, wobei die markierte DNA direkt sichtbar gemacht werden kann. 

35 

9. Methode nach Anspruch 1 1 ferner einschlieSend ein Visualisierbarmachen der gebundenen, markierten DNA nach 
dem Hybridisierungsschritt. 

10. Methode nach Anspruch 1, wobei ein Blockieren der Wiederholungssequenzen durch EinschlteBen unmarkierter 
40 Kopien der Wiederholungssequenzen mit den markierten DNA-Sequenzen durchgefuhrt wird. 

1 1 . Methode nach Anspruch 1 , wobei mindestens eine der ersten Zelle oder Zellpopulation und der zweiten Zelle oder 
Zellpopulation von einer klinischen Probe stammt. 

45 12. Methode nach Anspruch 1 , wobei die Hybridisierung in situ erfolgt, das Bezugsgenom Bezugs-Metaphasen-Chro- 
mosomen enthalt und die Intensitaten von Signalen von markierten Nucleinsauresequenzen in Abhangigkeit der 
Position im Bezugsgenom verglichen werden. 

13. Methode nach Anspruch 1, wobei die markierten DNA-Sequenzen mit einem Teil des Bezugsgenoms hybridisiert 
so werden. 

14. Methode nach Anspruch 1 , wobei die erste Zelle oder Zellpopulation und die zweite Zelle oder Zellpopulation und 
das Bezugsgenom von derselben Spezies sind. 

55 15. Methode nach Anspruch 1, wobei das Bezugsgenom menschltch ist. 

16. Methode nach Anspruch 1, wobei die markierten Nucleinsauresequenzen mit Fluoreszierern, Liganden, Chemi- 
lumineszierem : Enzymsubstraten, Enzymcofaktoren, Teilchen oder Farbstoffen markiert sind. 
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17. Methode nach Anspruch 1, welche Methode ferner vor dem Markieren ein Extrahieren der DNA aus der ersten 
Zelle Oder Zellpopulation und/oder der zweiten Zelle oder Zellpopulation von mit Formalin fixierten und/oder in 
Paraffin eingeschlossenen archivierten Gewebeproben einschlieBt. 

18. Methode nach Anspruch 1, wobei der Schritt des Vergleichens der Intensitaten der Signale von den markierten 
DNA-Sequenzen ein Bestimmen des Verhaltnisses der Intensitaten der Signale in Abhangigkeit der Position im 
Bezugsgenom einschlieBt wobei die Methode ferner ein quantitatives Vergleichen des Intensitatsverhaltnisses 
unter unterschiedlichen Platzen entlang des Bezugsgenoms einschlieBt, wobei das Verhaltnis an jedem Platz 
proportional zum Verhaltnis der Kopienanzahl der Nucleinsauresequenz, die sich an diesen Platz bindet, in der 
ersten Zelle oder Zellpopulation zur Kopienanzahl einer im Wesentlichen identischen Sequenz in der zweiten Zelle 
oder Zellpopulation ist. 

19. Methode nach Anspruch 1 , wobei eine der Zellen oder Zellpopulationen eine Testzelle oder -zellpopulation ist und 
die andere eine normale Zelle oder Zellpopulation ist. 

20. Methode nach Anspruch 1, ferner einschlieBend ein Einsetzen einer in srtu-Hybridisierung oder einer Southern- 
Analyse, urn die absolute Kopienanzahl einmaliger DNA-Sequenzen im gesamten Genom zu bestimmen. 

21. Methode nach Anspruch 1, wobei die Kopienanzahl einmaliger DNA-Sequenzen in mehr als zwei Zellen oder 
Zellpopulationen verglichen wird. 

22. Methode nach Anspruch 1„ wobei die DNA-Sequenzen aus jeder Zelle oder Zellpopulation mit einer unterschied- 
lichen Fluorochrom-Markierung markiert werden und die Fluorochrom-markierten DNA-Sequenzen aus jeder Zelle 
oder Zellpopulation in situ mit dem Bezugsgenom hybridisiert werden. 

23. Methode nach Anspruch 22, wobei das Bezugsgenom Metaphasen-Chromsomen enthalt. 

24. Methode nach Anspruch 22 : wobei das Bezugsgenom Bezugs-Metaphasen-Chromosomen enthalt und die Inten- 
sitaten von Signalen von markierten DNA-Sequenzen in Abhangigkeit der Position im Bezugsgenom verglichen 
werden. 

25. Methode nach Anspruch 22, wobei der Schritt des Vergleichens der Intensitaten der Signale von den markierten 
DNA-Sequenzen ein Bestimmen des Verhaltnisses der Intensitaten der Signale in Abhangigkeit der Position im 
Bezugsgenom einschiieBt. 

26. Methode nach Anspruch 25 : ferner einschlieftendein quantitatives Vergleichen des Intensitatsverhaltnisses unter 
unterschiedlichen Platzen entlang des Bezugsgenoms, wobei das Verhaltnis an jedem Platz proportional zum 
Verhaltnis der Kopienanzahl der DNA-Sequenz, die sich an diesen Platz bindet, in der ersten Zelle oder Zellpo- 
pulation zur Kopienanzahl einer im Wesentlichen identischen DNA-Sequenz in der zweiten Zelle oder Zellpopu- 
lation ist. 

27. Methode nach Anspruch 22, wobei die Kopienanzahl einmaliger DNA-Sequenzen in mehr als zwei Zellen oder 
Zellpopulationen verglichen wird. 

28. Methode zum Vergleichen von Kopienanzahlen einmaliger RNA-Sequenzen in einer ersten Zelle oder Zellpopu- 
lation irn Verhaltnis zu Kopienanzahlen von im Wesentlichen identischen Sequenzen in einer zweiten Zelle Oder 
Zellpopulation, wobei die Methode die nachstehenden Schritte einschlieBt: (a) Markieren von RNA-Sequenzen 
aus jeder Zelle oder Zellpopulation mit einer unterschiedlichen Markierung; (b) Hybridisieren der markierten RNA- 
Sequenzen aus jeder Zelle oder Zellpopulation mit einem Bezugsgenom, und zwar unter den folgenden Bedin- 
gungen: (i) die markierten RNA-Sequenzen und/oder das Bezugsgenom weisen ihre Wiederholungssequenzen 
btockiert und/oder entfernt auf, falls erforderlich : und (ii) einmalige RNA-Sequenzen in den markierten RNA-Se- 
quenzen und einmalige RNA-Sequenzen im Bezugsgenom werden zuruckbehalten; (c) Vergleichen der Intensi- 
taten der Signale von jenen markierten RNA-Sequenzen - falls es welche gibt - : die mit dem Bezugsgenom hybri- 
disiert werden, urn die relative Kopienanzahl unterschiedlich markierter, mit der gleichen Position im Bezugsgenom 
hybridisierter RNA-Sequenzen zu bestimmen. 

29. Methode nach Anspruch 28, wobei das Bezugsgenom mindestens ein Metaphasen-Chromosom enthalt. 
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30. Methode nach Anspruch 28, wobei die RNA-Sequenzen aus mindestens einer der ersten Zelle oder Zellpopulation 
und der zweiten Zelle oder Zellpopulation aus Tumorzellen isoliert sind. 

31 . Methode nach Anspruch 28, wobei die RNA-Sequenzen aus mindestens einer der ersten Zelle oder Zellpopulation 
5 und der zweiten Zelle oder Zellpopulation aus fetalen Zellen isoliert sind. 

32. Methode nach Anspruch 28, wobei die RNA Boten-RNA ist. 

33. Methode nach Anspruch 28, wobei das Bezugsgenom eine Antenna-Zelllinie ist. 

10 

34. Methode nach Anspruch 28, ferner einschlieBend den Schritt eines Amplifizierens der RNA-Sequenzen aus min- 
destens einer der ersten Zelle oder Zellpopulation und der zweiten Zelle oder Zellpopulation vor dem Hybridisie- 
rungsschritt. 

15 35. Methode nach Anspruch 28, wobei die markierte RNA direkt sichtbar gemacht werden kann. 

36. Methode nach Anspruch 28 : ferner einschlieBend ein Visualtsierbarmachen der gebundenen, markierten RNA 
nach dem Hybridisierungsschritt. 

20 37. Methode nach Anspruch 28, wobei ein Blockieren der Wiederholungssequenzen durch EinschlieBen unmarkierter 
Kopien der Wiederholungssequenzen mit den markierten RNA-Sequenzen durchgefuhrt wird. 

38. Methode nach Anspruch 28, wobei mindestens eine der ersten Zelle oder Zellpopulation und der zweiten Zelle 
oder Zellpopulation von einer klinischen Probe stammt. 

25 

39. Methode nach Anspruch 28, wobei die Hybridisierung in situ erfolgt, das Bezugsgenom Bezugs-Metaphasen- 
Chromosomen enthalt und die Intensitaten von Signalen von markierten Nucleinsauresequenzen in Abhangigkeit 
der Position im Bezugsgenom verglichen werden. 

30 40. Methode nach Anspruch 28, wobei die markierten RNA-Sequenzen mit einem Teil des Bezugsgenoms hybridisiert 
werden. 

41. Methode nach Anspruch 28, wobei die erste Zelle oder Zellpopulation und die zweite Zelle oder Zellpopulation 
und das Bezugsgenom von derselben Spezies sind. 

35 

42. Methode nach Anspruch 28, wobei das Bezugsgenom menschlich ist. 

43. Methode nach Anspruch 28 : wobei die markierten RNA-Sequenzen mit Fluoresziere'rn, Liganden, Chemilumines- 
zierern, Enzymsubstraten, Enzymcofaktoren, Teilchen oder Farbstoffen markiert sind. 

40 

44. Methode nach Anspruch 28, welche Methode ferner vor dem Markieren ein Extrahieren der RNA aus der ersten 
Zelle oder Zellpopulation und/oder der zweiten Zelle oder Zellpopulation von mit Formalin fixierten und/oder in 
Paraffin eingeschlossenen archivierten Gewebeproben einschlieBt. 

45 45. Methode nach Anspruch 28, wobei der Schritt des Vergleichens der Intensitaten der Signale von den markierten 
RNA-Sequenzen ein Bestimmen des Verhaltnisses der Intensitaten der Signale in Abhangigkeit der Position im 
Bezugsgenom einschlieBt, wobei die Methode ferner ein quantitatives Vergleichen des Intensitatsverhattnisses 
unter unterschiedlichen Platzen entlang des Bezugsgenoms einschlieBt, wobei das Verhaltnis an jedem Platz 
proportional zum Verhaltnis der Kopienanzahl der Nucleinsauresequenz, die sich an diesen Platz bindet, in der 

so ersten Zelle oder Zellpopulation zur Kopienanzahl einer im Wesentlichen identischen Sequenz in der zweiten Zelle 

oder Zellpopulation ist. 

46. Methode nach Anspruch 28, wobei eine der Zellen oder Zellpopulationen die Testzelle oder -zellpopulation ist und 
die andere eine normale Zelle oder Zellpopulation ist. 

55 

47. Methode nach Anspruch 28, ferner einschlieBend ein Einsetzen einer in srtu-Hybridisierung oder einer Northern- 
Analyse, urn die absolute Kopienanzahl einmaliger RNA-Sequenzen im gesamten Genom zu bestimmen. 
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48. Methode nach Anspruch 28, wobei die Kopienanzahl einmaliger RNA-Sequenzen in mehr als zwei Zellen oder 
Zellpopulationen verglichen wird. 

49. Methode nach Anspruch 28, wobei die RNA-Sequenzen aus jeder Zelle oder Zellpopulation mit einer unterschied- 
5 lichen Fluorochrom-Markierung markierl werden und die Fluorochrom-markierten RNA-Sequenzen aus jeder Zelle 

oder Zellpopulation in situ mit dem Bezugsgenom hybridisiert werden. 

50. Methode nach Anspruch 49, wobei das Bezugsgenom Metaphasen-Chromsomen enthalt. 

io 51 . Methode nach Anspruch 49 t wobei das Bezugsgenom Bezugs-Metaphasen-Chromosomen enthalt und die Inten- 
sitaten von Signalen von markierten RNA-Sequenzen in Abhangigkeit der Position im Bezugsgenom verglichen 
werden. 

52. Methode nach Anspruch 49, wobei der Schritt des Vergleichens der Intensitaten der Signaie von den markierten 
'5 RNA-Sequenzen ein Bestimmen des Verhaltnisses der Intensitaten der Signate in Abhangigkeit der Position im 

Bezugsgenom einschlieBt. 

53. Methode nach Anspruch 52, ferner einschlieBend ein quantitatives Vergleichen des Intensitatsverhaltnisses unter 
unterschiedlichen Platzen entlang des Bezugsgenoms, wobei das Verhaltnis an jedem Platz proportional zum 

20 Verhaltnis der Kopienanzahl der RNA-Sequenz, die sich an diesen Platz bindet, in der ersten Zelle oder Zellpo- 

pulation zur Kopienanzahl einer im Wesentlichen identischen RNA-Sequenz tn der zweiten Zelle oder Zellpopu- 
lation ist. 

54. Methode nach Anspruch 49, wobei die Kopienanzahl einmaliger RNA-Sequenzen in mehr als zwei Zellen oder 
25 Zellpopulationen verglichen wird. 

55. Methode nach Anspruch 25, zusatzlich einschlieBend: 

(d) Bestimmen der Kopienanzahl einer Kalibrierungssequenz in den ersten und zweiten Zellen oderZellpo- 
30 pulationen, wobei die Kalibrierungssequenz im Wesentlichen identisch mit einer einmaligen Sequenz im Be- 
zugsgenom ist, und 

(e) Normalisieren der in Anspruch 25 bestimmten Verhaltnisse, so dass das Verhaltnis an der Kalibrierungs- 
position im Bezugsgenom gleich dem Verhaltnis der im Schritt (d) bestimmten Kopienanzahlen ist, wobei das 
normalisierte Verhaltnis an jedem anderen Platz im Bezugsgenom dadurch das Verhaltnis der Kopienanzahlen 

35 der DNA-Sequenzen in den markierten Nucleinsauren, die mit diesem Platz hybridisieren, angibt. 

56. Methode nach Anspruch 18, zusatzlich einschlieBend: 

(d) Bestimmen der Kopienanzahl einer Kalibrierungssequenz in den ersten und zweiten Zellen oder Zellpo- 
pulationen, wobei die Kalibrierungssequenz im Wesentlichen identisch mit einer einmaligen Sequenz im Be- 
zugsgenom ist, und 

(e) Normalisieren der in Anspruch 18 bestimmten Verhaltnisse, so dass das Verhaltnis an der Kalibrierungs- 
position im Bezugsgenom gleich dem Verhaltnis der im Schritt (d) bestimmten Kopienanzahlen ist, wobei das 
normalisierte Verhaltnis an jedem anderen Platz im Bezugsgenom dadurch das Verhaltnis der Kopienanzahlen 
der DNA-Sequenzen in den markierten Nucleinsauren, die mit diesem Platz hybridisieren, angibt, 

wobei der Schritt des quantitativen Vergleichens des Intensitatsverhaltnisses unter unterschiedlichen Positionen 
entlang des Bezugsgenoms den Vergleich von im Schritt (e) bestimmten normalisierten Verhaltnissen einschlieBt. 

50 57. Methode nach Anspruch 52, zusatzlich einschlieBend: 

(d) Bestimmen der Kopienanzahl einer Kalibrierungssequenz in den ersten und zweiten Zellen oder Zellpo- 
pulationen, wobei die Kalibrierungssequenz im Wesentlichen identisch mit einer einmaligen Sequenz im Be- 
zugsgenom ist, und 

55 (e) Normalisieren der in Anspruch 52 bestimmten Verhaltnisse, so dass das Verhaltnis an der Kalibrierungs- 

position im Bezugsgenom gleich dem Verhaltnis der im Schritt (d) bestimmten Kopienanzahlen ist, wobei das 
normalisierte Verhaltnis an jedem anderen Platz im Bezugsgenom dadurch das Verhaltnis der Kopienanzahlen 
der RNA-Sequenzen in den markierten Nucleinsauren, die mit diesem Platz hybridisieren, angibt. 
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58. Methode nach Anspruch 45, zusatzlich einschlieBend: 

(d) Bestimmen der Kopienanzahl einer Kalibrierungssequenz in den ersten und zweiten Zelten oderZellpo- 
pulationen, wobei die Kalibrierungssequenz im Wesentlichen identisch mit einer einmaligen Sequenz im Be- 

zugsgenom ist, und . 

(e) Normalisieren der in Anspruch 45 bestimmten Verhaltnisse, so dass das Verhaltnis an der Kahbrierungs- 
position im Bezugsgenom gleich dem Verhaltnis der im Schritt (d) bestimmten Kopienanzahien ist, wobei das 
normalisierte Verhaltnis an jedem anderen Platz im Bezugsgenom dadurch das Verhaltnis der Kopienanzahien 
der RNA-Sequenzen in den markierten Nucleinsauren, die mit diesem Platz hybridisieren, angibt, 

wobei der Schritt des quantitativen Vergleichens des Intensitatsverhaltnisses unter unterschiediichen Positionen 
entiang des Bezugsgenoms den Vergleich von im Schritt (e) bestimmten normalisierten Verhaltnissen einschlieBt. 



Revendications 

1 Precede pour comparer !es nombres de copies de sequences d'ADN uniques dans une premiere cellule ou po- 
pulation de cellules, par rapport aux nombres de copies de sequences sensiblement identiques dans une seconde 
cellule ou population de cellules, ledit procede comprenant les etapes consistant a: (a) marquer les sequences 
d'ADN issues de chaque cellule ou population de cellules a I'aide d'un marqueur different; (b) hybrider lesdites 
sequences d'ADN marquees issues de chaque cellule ou population de cellules a un genome de reference dans 
les conditions suivantes: (i) les sequences d'ADN marquees et/ou le genome de reference ont leurs sequences 
repetitives bloquees et/ou retirees, si necessaire, et (ii) des sequences d'ADN uniques dans les sequences d'ADN 
marquees et des sequences d'ADN uniques dans le genome de reference sont maintenues; (c) comparer les 
intensities signaux provenant de ces sequences d'ADN marquees, s'il y a lieu, qui s'hybrident au genome de 
reference, pour determiner le nombre de copies relatif de sequences d'ADN marquees de facon differente, hybri- 
dees a la meme position dans le genome de reference. 

2. Procede de la revendication 1 , dans lequel le genome de reference comprend au moins un chromosome meta- 
phasique. 

3 Procede de la revendication 1 , dans lequel les sequences d'ADN de Tune au moins parmi ladite premiere cellule 
ou population de cellules et ladite seconde cellule ou population de cellules sont isolees de cellules tumorales. 

4 Procede de la revendication 1 , dans lequel les sequences d'ADN de I'une au moins parmi ladite premiere cellule 
ou population de cellules et ladite seconde cellule ou population de cellules sont isolees de cellules foetales. 

5. Procede de la revendication 1 , dans lequel i'ADN est un ADN chromosomique ou un ADNc. 

6. Procede de la revendication 1 , dans lequel le genome de reference est une lignee cellulaire antenne. 

7 Procede de la revendication 1 , comprenant en outre I'etape ^amplification des sequences d'ADN de I'une au 
moins parmi ladite premiere cellule ou population de cellules et ladite seconde cellule ou population de cellules 
avant ladite etape d'hybridation. 

8. Procede de la revendication 1 , dans lequel I'ADN marque est directement visualisable. 

9. Procede de la revendication 1, comprenant en outre I'etape consistant a rendre I'ADN marque lie visualisable 
apres ladite etape d'hybridation. 

' 10. Procede de la revendication 1 , dans lequel le blocage des sequences repetitives est realise en incorporant des 
copies non marquees des sequences repetitives avec lesdites sequences d'ADN marquees. 

1 1 . Procede selon la revendication 1 , dans lequel I'une au moins parmi ladite premiere cellule ou population de cellules 
et ladite seconde cellule ou population de cellules est derivee d'un echantillon ciinique. 

12 Procede selon la revendication 1. dans lequel ('hybridation s'effectue in situ, le genome de reference comprend 
des chromosomes metaphasiques de reference, et les intensites des signaux provenant des sequences d'acides 
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nucleiques marquees sont comparees en fonction de leur position dans le genome de reference. 

13. Procede selon la revendication 1, dans lequel les sequences d'ADN marquees sont hybridees a une partie du 
genome de reference. 

5 

14. Procede selon la revendication 1, dans lequel ladite premiere cellule ou population de cellules et ladite seconde 
cellule ou population de cellules ainsi que ledit genome de reference sont issus de la meme espece. 

15. Procede selon la revendication 1 , dans lequel le genome de reference est le genome humain. 

10 

16. Procede selon la revendication 1, dans lequel lesdites sequences d'acides nucleiques marquees sont marquees 
avec des agents fluorescents, des ligands, des agents chimioluminescents, des substrats enzymatiques, des co- 
facteurs d'enzymes. des particules ou des colorants. 

15 1 7. Procede selon la revendication 1 , lequel procede comprend en outre Pextractton de I'ADN de ladite premiere cellule 
ou population de cellules et/ou de ladite seconde cellule ou population de cellules a partir d'echantillons tissulaires 
archives, fixes dans du formol et/ou inclus dans la paraffine, avant le marquage. 

Procede selon la revendication 1 1 dans lequel I'etape de comparison des intensites des signaux des sequences 
d'ADN marquees comprend la determination du rapport des intensites des signaux en fonction de la position dans 
le genome de reference, ledit procede comprenant en outre la comparaison quantitative du rapport d'intensites 
parmi differentes localisations le long du genome de reference, ledit rapport au niveau de chaque localisation etant 
proportionnel au rapport du nombre de copies de la sequence d'acides nucleiques qui se fixe a cette localisation 
dans la premiere cellule ou population de cellules au nombre de copies d'une sequence sensiblement identique 
dans la seconde cellule ou population de cellules. 

1 9. Procede selon la revendication 1 1 dans lequel Tune des cellules ou I'une des populations de cellules est une cellule 
ou population de cellules d'essai et I'autre est une cellule normale ou une population de cellules normales. 

30 20. Procede selon la revendication 1, comprenant en outre I'emploi d'une hybridation in situ ou d'une analyse de 
Southern pour determiner le nombre absolu de copies de sequences d'ADN uniques dans tout le genome. 

21. Procede selon la revendication 1, dans lequel on compare le nombre de copies de sequences d'ADN uniques 
dans plus de deux cellules ou de deux populations de cellules. 

35 

22. Procede selon la revendication 1 , dans lequel les sequences d'ADN de chaque cellule ou population de cellules 
sont marquees a I'aide d'un marqueur fluorochrome different, et les sequences d'ADN marquees au fluorochrome 
de chaque cellule ou population de cellules sont hybridees in situ au genome de reference. 

40 23. Procede de la revendication 22, dans lequel le genome de reference comprend des chromosomes metaphasiques. 

24. Procede selon la revendication 22, dans lequel le genome de reference comprend des chromosomes metapha- 
siques de reference, et les intensites des signaux provenant des sequences d'ADN marquees sont comparees en 
fonction de leur position dans le genome de reference. 

45 

25. Procede selon la revendication 22, dans lequel I'etape de comparaison des intensites des signaux provenant des 
sequences d'ADN marquees comprend la determination du rapport des intensites des signaux en fonction de leur 
position dans le genome de reference. 

50 26. Procede selon la revendication 25, comprenant en outre la comparaison quantitative du rapport d'intensites parmi 
differentes localisations le long du genome de reference, (edit rapport au niveau de chaque localisation etant 
proportionnel au rapport du nombre de copies de la sequence d'ADN qui se fixe a cette localisation dans la prem iere 
cellule ou population de cellules au nombre de copies d'une sequence d'ADN sensiblement identique dans la 
seconde cellule ou population de cellules. 

55 

27. Procede selon la revendication 22 ; dans lequel on compare le nombre de copies de sequences d'ADN uniques 
dans plus de deux cellules ou de deux populations de cellules. 
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28. Procede pour comparer les nombres de copies de sequences d'ARN uniques dans une premiere cellule ou po- 
pulation de cellules, par rapport aux nombres de copies de sequences sensiblement identiques dans une seconde 
cellule ou population de cellules, ledit procede comprenant les etapes consistant a: (a) marquer les sequences 
d'ARN issues de chaque cellule ou population de cellules a ('aide d'un marqueur different; (b) hybrider lesdites 
sequences d'ARN marquees issues de chaque cellule ou population de cellules a un genome de reference dans 
les conditions suivantes: (i) les sequences d'ARN marquees, et/ou le genome de reference ont leurs sequences 
repetitives bloquees et/ou retirees, si necessaire, et (ii) des sequences d'ARN uniques dans les sequences d'ARN 
marquees et des sequences d'ARN uniques dans le genome de reference sont maintenues; (c) comparer les 
intensites des signaux provenant de ces sequences d'ARN marquees, s'il y a lieu, qui s'hybrident au genome de 
reference, pour determiner le nombre de copies relatif de sequences d'ARN marquees de facon differente, hybri- 
dees a la meme position dans le genome de reference. 

29. Procede de la revendication 28, dans lequel le genome de reference comprend au moins un chromosome meta- 
phasique. 

30. Procede de la revendication 28, dans lequel les sequences d'ARN de I'une au moins parmi ladite premiere cellule 
ou population de cellules et ladite seconde cellule ou population de cellules sont isolees de cellules tumorales. 

31. Procede de la revendication 28, dans lequel les sequences d'ARN de I'une au moins parmi ladite premiere cellule 
ou population de cellules et ladite seconde cellule ou population de cellules sont isolees de cellules foetales. 

32. Procede de la revendication 28, dans lequel TARN est un ARN messager. 

33. Procede de la revendication 28, dans lequel le genome de reference est une lignee celtulaire antenne. 

34. Procede de la revendication 28, comprenant en outre I'etape d'amplification des sequences d'ARN de I'une au 
moins parmi ladite premiere cellule ou population de cellules et ladite seconde cellule ou population de cellules 
avant ladite etape d'hybridation. 

35. Procede de la revendication 28, dans lequel TARN marque est directement visualisable. 

36. Procede de la revendication 28, comprenant en outre I'etape consistant a rendre I'ARN marque lie visualisable 
apres ladite etape d'hybridation. 

37. Procede de la revendication 28, dans lequel le blocage des sequences repetitives est realise en incorporant des 
copies non marquees des sequences repetitives avec lesdites sequences d'ARN marquees. 

38. Procede seton la revendication 28, dans lequel I'une au moins parmi ladite premiere cellule ou population de 
cellules et ladite seconde cellule ou population de cellules est derivee d'un echantillon clinique. 

39. Procede selon la revendication 28, dans lequel I'hybridation s'effectue in situ, le genome de reference comprend 
des chromosomes metaphasiques de reference, et les intensites des signaux provenant des sequences d'acides 
nucleiques marquees sont comparees en fonction de la position dans le genome de reference. 

40. Procede selon la revendication 28, dans lequel les sequences d'ARN marquees sont hybridees a une partie du 
genome de reference. 

41 . Procede seton la revendication 28, dans lequel ladite premiere cellule ou population de cellules et ladite seconde 
cellule ou population de cellules ainsi que ledit genome de reference sont issus de la meme espece. 

42. Procede selon la revendication 28, dans lequel le genome de reference est le genome humain. 

43. Procede selon la revendication 28, dans lequel lesdites sequences d'ARN marquees sont marquees avec des 
agents fluorescents, des ligands, des agents chimioluminescents, des substrats enzymatiques ; des cofacteurs 
d'enzymes, des particules ou des colorants. 

44. Procede selon la revendication 28, lequel procede comprend en outre I'extraction de I'ARN de ladite premiere 
cellule ou population de cellules et/ou de ladite seconde cellule ou population de cellules a partir d'echantillons 
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tissulaires archives, fixes dans du formol et/ou inclus dans la paraffine, avant le marquage. 

45. Procede selon la revendication 28, dans lequel I'etape de comparaison des intensites des signaux des sequences 
d'ARN marquees comprend la determination du rapport des intensites des signaux en fonction de la position dans 

5 le genome de reference, ledit procede comprenant en outre la comparaison quantitative du rapport d'intensites 

parmi differentes localisations le long du genome de reference, ledit rapport au niveau de chaque localisation etant 
proportionnel au rapport du nombre de copies de la sequence d'acides nucleiques qui se fixe a cette localisation 
dans la premiere cellule ou population de cellules au nombre de copies d'une sequence sensiblement identique 
dans la seconde cellule ou population de cellules. 

10 

46. Procede selon la revendication 28, dans lequel Tune des cellules ou I'une des populations de cellules est une 
cellule ou population de cellules d'essai et ('autre est une cellule normale ou une population de cellules normales. 

47. Procede selon la revendication 28, comprenant en outre Pemploi d'une hybridation in situ ou d'une analyse de 
15 Northern pour determiner le nombre absolu de copies de sequences d'ADN uniques dans tout le genome. 

48. Procede selon la revendication 28, dans lequel on compare le nombre de copies de sequences d'ARN uniques 
dans plus de deux cellules ou de deux populations de cellules. 

20 49. Procede selon la revendication 28, dans lequel les sequences d'ARN de chaque cellule ou population de cellules 
sont marquees a I'aide d'un marqueurfluorochrome different, et les sequences d'ARN marquees au fluorochrome 
de chaque cellule ou population de cellules sont hybridees in situ au genome de reference. 

50. Procede de (a revendication 49, dans lequel le genome de reference comprend des chromosomes metaphasiques. 

25 

51. Procede selon la revendication 49, dans lequel le genome de reference comprend des chromosomes metapha- 
siques de reference, et les intensites des signaux provenant des sequences d'ARN marquees sont comparees en 
fonction de leur position dans le genome de reference. 

30 52. Procede selon la revendication 49, dans lequel I'etape de comparaison des intensites des signaux provenant des 
sequences d'ARN marquees comprend la determination du rapport des intensites des signaux en fonction de leur 
position dans le genome de reference. 

53. Procede selon la revendication 52, comprenant en outre la comparaison quantitative du rapport d'intensites parmi 
35 differentes localisations dans le genome de reference, ledit rapport au niveau de chaque localisation etant pro- 

portionnel au rapport du nombre de copies de la sequence d'ARN qui se fixe a cette localisation dans la premiere 
cellule ou population de cellules au nombre de copies d'une sequence d'ARN sensiblement identique dans la 
seconde cellule ou population de cellules. 

40 54. Procede selon la revendication 49 : dans lequel on compare le nombre de copies de sequences d'ARN uniques 
dans plus de deux cellules ou de deux populations de cellules. 

55. Procede selon la revendication 25, comprenant en outre : 

45 (d) la determination du nombre de copies d'une sequence d'etalonnage dans lesdites premieres et secondes 

cellules ou populations de cellules, ladite sequence d'etalonnage etant sensiblement identique a une sequence 
unique dans le genome de reference; et 

(e) la normalisation des rapports determines dans la revendication 25 de sorte que le rapport au niveau de la 
position d'etalonnage dans le genome de reference soit egal au rapport des nombres de copies determines 
so dans I'etape (d), le rapport normalise a n'importe quelle autre localisation dans le genome de reference donnant 

ainsi le rapport des nombres de copies des sequences d'ADN dans les acides nucleiques marques qui s'hy- 
brident a cette localisation. 

56. Procede selon la revendication 18, comprenant en outre: 

55 

(d) !a determination du nombre de copies d'une sequence d'etalonnage dans lesdites premieres et secondes 
cellules ou populations de cellules, ladite sequence d'etalonnage etant sensiblement identique a une sequence 
unique dans le genome de reference; et 



41 



EP0 631 635 B1 



(e) la normalisation des rapports determines dans la revendication 18 de sorte que le rapport au niveau de la, 
position d'etalonnage dans le genome de reference soit egal au rapport des nombres de copies determines 
dans I'etape (d), le rapport normalise a n'importe quelle autre localisation dans le genome de reference donnant 
ainsi le rapport des nombres de copies des sequences dVVDN dans les acides nucleiques marques qui s'hy- 
brident a cette localisation, 

dans lequel ladite etape de comparaison quantitative du rapport d'intensites parmi differentes positions le long du 
genome de reference comprend la comparaison des rapports normalise's determines dans I'etape (e). 

57. Procede selon la revendication 52, comprenant en outre: 

.(d) la determination du nornbre de copies d'une sequence d'etalonnage dans lesdites premieres et secondes 
cellules ou populations de cellules, ladite sequence d'etalonnage etant sensiblement identique a une sequence 
unique dans le genome de reference; et 

(e) la normalisation des rapports determines dans la revendication 52 de sorte que le rapport au niveau de la 
position d'etalonnage dans le genome de reference soit egal au rapport des nombres de copies determines 
dans I'etape (d), le rapport normalise a n'importe quelle autre localisation dans le genome de reference donnant 
ainsi le rapport des nombres de copies des sequences d'ARN dans les acides nucleiques marques qui s'hy- 
brident a cette localisation. 



58. Procede selon la revendication 45, comprenant en outre: 

(d) la determination du nornbre de copies d'une sequence d'etalonnage dans lesdites premieres et secondes 
cellules ou populations de cellules, ladite sequence d'etalonnage etant sensiblement identique a une sequence 
unique dans le genome de reference; et 

(e) la normalisation des rapports determines dans la revendication 45 de sorte que le rapport au niveau de la 
position d'etalonnage dans le genome de reference soit egal au rapport des nombres de copies determines 
dans I'etape (d), le rapport normalise a n'importe quelle autre localisation dans le genome de reference donnant 
ainsi le rapport des nombres de copies des sequences d'ARN dans les acides nucleiques marques qui s'hy- 
brident a cette localisation, . 

dans lequel ladite etape de comparaison quantitative du rapport d'intensites parmi differentes positions le long du 
genome de reference comprend la comparaison des rapports normalises determines dans I'etape (e). 
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